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(57) Abstract: Methods for altering the N-glycosylation pattern of proteins in higher plants are provided. The methods comprise 
introducing into the plant a recombinant construct that provides for Ihc inhibition of expression of oc],3-fucosyltransfcrase (FucT) 
and ^ 1 ,2-xylosyltransfcrase (XylT) in a plant. Use of these constructs to inhibit or suppress expression of both of these enzymes, 
^ and isoforms thereof, advantageously provides for the production of endogenous and heterologous proteins having a "humanized" 
N-glycosylalion pattern without impacting plant growth and development. Stably transformed higher plants having this protein 
N-glycosylation pattern are provided. Glycoprotein compositions, including monoclonal antibody compositions, having substan- 
tially homogeneous glycosylation profiles, and which are substantially homogeneous for the GO glycoform, are also provided. 



wo 2007/084922 



PCT/US2007/060642 



COMPOSITIONS AND METHODS FOR HUMANIZATION 
AND OPTIMIZATION OF iV-GLYCANS IN PLANTS 



FIELD OF THE INVENTION 
The present invention is related to the field of plant molecular biology, more 
particularly to the field of recombinant mammalian protein production in plants. 

5 BACKGROUND OF THE INVENTION 

A number of plant species have been targeted for use in "molecular farming" 
of mammalian proteins of pharmaceutical interest. These plant expression systems 
provide for low cost production of biologically active mammalian proteins and are 
readily amenable to rapid and economical scale-up (Ma et al, (2003) Nat. Rev, Genet. 

10 4:794-805; Raskin et al. (2002) Trends Biotechnol 20:522-53 1 ). Large numbers of 

mammalian and plant proteins require post-translational processing for proper folding, 
assembly, and function. Of these modifications, the differences in glycosylation 
pattems between plants and mammals offer a challenge to the feasibility of plant 
expression systems to produce high quality recombinant mammalian proteins for 

15 pharmaceutical use. 

As peptides move through the endoplasmic reticulum (ER) and Golgi 
subcellular compartments, sugar residue chains, or glycans, are attached, ultimately 
leading to the formation of glycoproteins. The linkage between the sugar chains and 
the peptide occurs by formation of a chemical bond to only one of four protein amino 

20 acids: asparagine, serine, threonine, and hydroxlysine. Based on this linkage pattern, 

two basic types of sugar residue chains in glycoproteins have been recognized: the A^- 
glycoside-linked sugar chain (also referred to as ^/-linked glycan or AA-glycan), which 
binds to asparagine residues on the peptide; and the O-glycoside-linked sugar chain, 
which binds to serine, threonine, and hydroxylysine residues on the peptide. 

25 The A^-glycoside-linked sugar chains, or A^-glycans, have various structures 

(see, for example, Takahashi, ed. (1989) Biochemical Experimentation Method 23 - 
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Method for Studying Glycoprotein Sugar Chain (Gakujutsu Shuppan Center), but 
share a common oligomannosidic core (see Figure 29A). The initial steps in the 
glycosylation pathway leading to the formation of A^-glycans are conserved in plants 
and animals. However, the final steps involved in complex /V-glycan formation differ 
5 (Lerouge el al. (1998) Plant Mo. Biol 38:31-48; Steinkellner and Strasser (2003) Ann. 

Plant /?ev.9: 181-192). Plants produce glycoproteins with complex A/-glycans having 
a core bearing two N-acetylglucosamine (GlcNAc) residues that is similar to that 
observed in mammals. However, in plant glycoproteins this core is substituted by a 
pl,2-linked xylose residue (core xylose), which residue does not occur in humans, 

10 Lewis" epitopes, and an al,3-linked fticose (core a[ 1 ,3]-fucose) instead of an al,6- 

linked core fucose as in mammals (see, for example, Lerouge et al (1998) Plant Mol 
Biol 38:3 1-48 for a review) (see also Figure 29B). Both the a(l,3)-fucose and P(l,2)- 
xylose residues reportedly are, at least partly, responsible for the immunogenicity of 
plant glycoproteins in mammals (see, for example, Ree et al (2000) J. Biol Chem. 

15 15:1 1451-1 1458; Bardor et al (2003) Clycobiol 13:427-434; Garcia-Casado et al 

(1996) Glycobiol 6:471-477). Therefore removal of these potentially allergenic sugar 
residues from mammalian glycoproteins recombinantly produced in plants would 
overcome concerns about the use of these 
proteins as pharmaceuticals for treatment of humans. 

20 A number of recombinantly produced glycoproteins currently serve as 

therapeutics or are under clinical investigation. Examples include the interferons 
(IFNs), erythropoietin (EPO), tissue plasminogen activator (tPA), antithrombin, 
granulocyte-macrophage colony stimulating factor (GM-CSF), and therapeutic 
monoclonal antibodies (mABs). The oligosaccharide component of the iV-glycan 

25 structures of glycoproteins can influence their therapeutic efficacy, as well as their 

physical stability, resistance to protease attack, pharmacokinetics, interaction with the 
immune system, and specific biological activity. See, for example, Jenkins et al 
(1996) Nature Biotechnol 14:975-981. 

Methods are needed to alter the glycosylation pattern in plant expression 

30 systems, specifically to inhibit plant-specific glycosylation of the eukaryotic core 

structure, to advantageously produce recombinant mammalian proteins with a 
humanized glycosylation pattern. 
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BRIEF SUMMARY OF THE INVENTION 
Methods for altering the A^-glycosylation pattern of proteins in higher plants 
are provided. The methods comprise stably transforming the plant with at least one 
recombinant nucleotide construct that provides for the inhibition of expression of 
5 al,3-fucosyltransferase (FucT) and P 1 ,2-xylosyltransferase (XylT) in a plant. Use of 

these constructs to inhibit or suppress expression of one or both of these enzymes, and 
isoforms thereof, advantageously provides for the production of endogenous and 
heterologous proteins having a "humanized" //-glycosylation pattern without 
impacting plant growth and development. Stably transformed higher plants having 
10 this protein A^-glycosylation pattern are provided. In some embodiments, the plant is 

a crop plant that is a member of the dicots, such as pea, alfalfa, and tobacco; in other 
embodiments, the plant is a crop plant that is a monocot, such as rice or maize. In yet 
other embodiments, the plant is a member of the Lemnaceae family, for example, a 
Lemna sp. 

1 5 The transgenic plants of the invention have the ability to produce recombinant 

manunalian proteins having an ^-glycosylation pattern that is more similar to that of 
the mammalian host. Thus, in some embodiments, the recombinantly produced 
mammalian proteins are glycoproteins comprising complex //-glycans having a 
reduction in the attachment of the plant-specific a(l ,3)-fucose and P(l,2)-xylose 

20 residues. In other embodiments, these recombinantly produced glycoproteins 

comprise complex A/-glycans that are devoid of these plant-specific residues. In yet 
other embodiments, these recombinantly produced glycoproteins have 
GlcNAc2Man3GlcNAc2 as the single glycan species attached to the asparagine 
glycosylation site(s) within the glycoprotein. 

25 In some embodiments, the recombinantly produced glycoprotein is a 

monoclonal antibody. In this manner, the present invention provides transgenic plants 
that are capable of producing glycan-optimized monoclonal antibodies that 
specifically bind a target protein of interest, including monoclonal antibodies having 
increased effector function. Thus, in some embodiments, the recombinantly produced 

30 glycan-optimized monoclonal antibodies comprise complex AT-glycans that have a 

reduction in the attachment of a(l,3)-Iinked fucose residues, thereby increasing 
ADCC activity of these antibodies. In other embodiments, the recombinantly 
produced monoclonal antibodies comprise complex AMinked glycans that are devoid 
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of these plant-specific fucose residues. In this manner, the present invention provides 
for the production of a monoclonal antibody composition, w^herein at least 90% or 
more of the intact antibody is represented by a single glycoform, more particularly, 
the GO glycoform. Thus, in some embodiments of the invention, the recombinantly 
5 produced monoclonal antibodies have increased effector function, wherein the ADCC 

activity is increased and/or the ratio of ADCC/CDC activity is increased. In some of 
these embodiments, the recombinantly produced monoclonal antibodies have 
decreased CDC activity, which can advantageously reduce the potential for adverse 
side effects related to CDC activation upon administration. These glycan-optimized 

10 monoclonal antibodies advantageously can be used to alter current routes of 

administration and current therapeutic regimens, as their increased effector function 
means they can be dosed at lower concentrations and with less frequency, thereby 
reducing the potential for antibody toxicity and/or development of antibody tolerance. 
Furthermore, their improved effector function yields new approaches to treating 

15 clinical indications that have been previously been resistant or refractory to treatment 

with the corresponding monoclonal antibody produced in other recombinant host 
systems. 

Compositions for practicing the methods of the invention are provided. The 
compositions comprise novel isolated polynucleotides and polypeptides encoding a 
20 Lemna minor a 1 ,3-fucosyl transferase and P 1 ,2-xylosyltransferase, and variants and 

fragments thereof. Recombinant nucleotide constructs that target expression of these 
two proteins, or expression of variants thereof, are also provided, as are plant cells, 
plant tissues, plants, and seeds comprising these recombinant constructs. 



25 BRIEF DESCRIPTION OF THE FORMAL DRAWINGS 

Figure 1 sets forth the DNA (SEQ ID NO: 1 ; coding sequence set forth in SEQ 
ID N0:2) and amino acid (SEQ ID N0:3) sequences for the Lemna minor al,3- 
fucosyltransferase (FucT). The coding sequence is shown in bold. Nucleotides 
denoted by the single underline (— ) correspond to the FucT forward fragment within 

30 the RNAi expression cassette designed to inhibit expression of FucT (see Figure 5); 

nucleotides denoted by the double underline (=) correspond to the spacer sequence 
within this RNAi expression cassette. The FucT reverse fragment of the RNAi 
expression cassette is the antisense of the FucT forward fragment shown here. 
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Figure 2 sets forth an alignment of the Lemna minor FucT of SEQ ID N0:3 
with a 1. 3 -fiicosy [transferases from other higher plants. 

Figure 3 sets forth the DNA (SEQ ID N0:4; coding sequence set forth in SEQ 
ID N0:5) sequence for the Lemna minor pi,2-xylosyltransferase (XylT) isoform #1 
5 and the encoded amino acid (SEQ ID N0:6) sequence. Nucleotides denoted by the 

single underline (— ) correspond to the XylT forward fragment within the RNAi 
expression cassette designed to inhibit expression of XylT (see Figure 6); nucleotides 
denoted by the double underline (=) correspond to the spacer sequence within this 
RNAi expression cassette. The XylT reverse fragment of the RNAi expression 
10 cassette is the antisense of the XylT forward fragment shown here. 

Figure 4 sets forth an alignment of the Lemna minor XylT of SEQ ID N0:6 
with pi,2-xylosyltransferases from other higher plants. 

Figure 5 sets forth one strategy for designing a single-gene RNAi knockout of 
Lemna minor FucT. 

15 Figure 6 sets forth one strategy for designing a single-gene RNAi knockout of 

Lemna minor XylT based on the DNA sequence for XylT isoform #1. 

Figure 7 sets forth one strategy for designing a double-gene RNAi knockout of 
Lemna minor FucT and XylT where the XylT portion of the RNAi knockout is based 
on the DNA sequence for XylT isoform #1. 

20 Figure 8 shows the Fuc02 construct comprising an RNAi expression cassette 

designed for single-gene RNAi knockout of Lemna minor FucT. Expression of the 
FucT inhibitory sequence (denoted by FucT forward and FucT reverse arrows; see 
Figure S) is driven by an operably linked expression control element (denoted as 
AocsAocsAocsAmasPmas) comprising three upstream activating sequences (Aocs) 

25 derived from the Agrobacterium tumefaciens octopine synthase gene operably linked 

to a promoter derived from an Agrobacterium tumefaciens mannopine synthase gene 
(AmasPmas). RbcS leader, rubisco small subunit leader sequence; ADHl, intron of 
maize alcohol dehydrogenase 1 gene; nos-ter, Agrobacterium tumefacians nopaline 
synthetase (nos) terminator sequence. 

30 Figure 9 shows the Xyl02 construct comprising an RNAi expression cassette 

designed for single-gene RNAi knockout of Lemna minor XylT, Expression of the 
XylT inhibitory sequence (denoted by XylT forward and XylT reverse arrows; see 
Figure 6) is driven by the operably linked AocsAocsAocsAmasPmas expression 
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control element. RbcS leader, rubisco small subunit leader sequence; ADHl, intron 
of maize alcohol dehydrogenase 1 gene; nosAQv, Agrobacterium tumefacians nopaline 
synthetase (nos) terminator sequence. 

Figure 10 shows the XF02 construct comprising a chimeric RNAi expression 
5 cassette designed for double-gene RNAi knockout oiLemna minor FucT/XylT. The 

hairpin RNA is expressed as a chimeric sequence (a chimeric hairpin RNA), where 
fragments of the two genes are fused together and expressed as one transcript. 
Expression of the FucT/XylT inhibitory sequence (denoted by FucT and XylT 
forward arrows and XylT and FucT reverse arrows; see Figure 7) is driven by the 
10 operably linked AocsAocsAocsAmasPmas expression control element. RbcS leader, 

rubisco small subunit leader sequence; ADHl, intron of maize alcohol dehydrogenase 
1 gene; nos-ter, Agrobacterium tumefacians nopaline synthetase (nos) terminator 
sequence. 

Figure 1 1 shows the XF03 construct comprising an RNAi expression cassette 

1 5 designed for double-gene RNAi knockout of Lemrta minor FucT/XylT. The cassette 

expresses two RNAi hairpins, one targeting expression of FucT, the other targeting 
expression of XylT. Expression of the FucT inhibitory sequence (denoted by FucT 
forward and FucT reverse arrows; see Figure 5) is driven by an operably linked 
expression control element comprising the Lemna minor ubiquitin promoter plus 5' 

20 UTR (LmUbq promoter) and intron (LmUbq intron) (see SEQ ID N0:7). Expression 

of the XylT inhibitory sequence (denoted by XylT forward and XylT reverse arrows; 
see Figure 6) is driven by the operably linked AocsAocsAocsAmasPmas expression 
control element. RbcS leader, rubisco small subunit leader sequence; ADHl, intron of 
maize alcohol dehydrogenase 1 gene; nos-ter, Agrobacterium tumefacians nopaline 

25 synthetase (nos) terminator sequence. 

Figure 12 shows the mAbI04 construct that provides for co-expression of an 
IgGl monoclonal antibody (referred to herein as mAbI) and the double-gene 
knockout of Lemna minor FucT and XylT, wherein a chimeric hairpin RNA targeting 
expression of the FucT and XylT is expressed. Expression of the FucT/XylT 

30 inhibitory sequence (denoted by FucT and XylT forward arrows and XylT and FucT 

reverse arrows; see Figure 7) is driven by an operably linked expression control 
element comprising the Spirodella polyrrhiza ubiquitin promoter plus 5' UTR (SpUbq 
promoter) and intron (SpUbq intron) (see SEQ ID NO:8). Expression of the IgGl 
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light chain is driven by an operably linked expression control element comprising the 
L. minor ubiquitin promoter plus 5' UTR (LmUbq promoter) and intron (LmUbq 
intron). Expression of the IgGl heavy chain is driven by the operably linked 
AocsAocsAocsAmasPmas expression control element. RbcS leader, rubisco small 
S subunit leader sequence; ADH 1 , intron of maize alcohol dehydrogenase 1 gene; nos- 

ter, Agrobacterium tumefacians nopaline synthetase (nos) terminator sequence. 

Figure 13 shows the mAblOS construct that provides for co-expression of 
mAbI and the double-knockout of Lemna minor FucT and XylT, wherein two hairpin 
RNAs are expressed, one targeting expression of FucT, the other targeting expression 

10 of the XylT. Expression of the FucT inhibitory sequence (denoted by FucT forward 

and FucT reverse arrows; see Figure 5) is driven by an operably linked expression 
control element comprising the S. polyrrhiza ubiquitin promoter plus 5' UTR (SpUbq 
promoter) and intron (SpUbq intron). Expression of the XylT inhibitory sequence 
(denoted by XylT forward and XylT reverse arrows; see Figure 6) is driven by an 

15 operably linked expression control element comprising the Lemna aequinoctialis 

ubiquitin promoter plus 5' UTR (LaUbq promoter) and intron (LaUbq intron) (see 
SEQ ID N0:9). Expression of the IgGl light chain is driven by an operably linked 
expression control element comprising the L. minor ubiquitin promoter plus 5' UTR 
(LmUbq promoter) and intron (LmUbq intron). Expression of the IgGl heavy chain 

20 is driven by the operably linked AocsAocsAocsAmasPmas expression control 

element. RbcS leader, rubisco small subunit leader sequence; ADH 1, intron of maize 
alcohol dehydrogenase 1 gene; nos-ter, Agrobacterium tumefacians nopaline 
synthetase (nos) terminator sequence. 

Figure 14 shows the mAblOl construct that provides for expression of mAbI, 

25 where FucT and XylT expression are not suppressed. Expression of the IgGl light 

chain and IgG 1 heavy chain are independently driven by the operably linked 
AocsAocsAocsAmasPmas expression control element RbcS leader, rubisco small 
subunit leader sequence; ADHl, intron of maize alcohol dehydrogenase 1 gene; nos- 
ter, Agrobacterium tumefacians nopaline synthetase (nos) terminator sequence. 

30 mAblOl is referred to as the 'Svild-type" mAblOl construct as the expressed mAbI 

exhibits the glycosylation profile of wild-type L minor. 

Figures 1 5 and 1 6 show the primary screening data for transgenic RN Ai L 
minor plant lines comprising the XF02 construct of Figure 10, 
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Figure 17 shows primary screening data for transgenic RNAi L, minor plant 
lines comprising the mAbI04 construct of Figure 12 and mAblOS construct of Figure 
13. 

Figure 18 shows the structure and molecular weight of derivatized wild-type 
5 L minor mAb A^-glycans. "GnGn" represents the GlcNAc2Man3GlcNAc2 A^-glycan 

species, also referred to as a GO A^-glycan species. "GnGnX" represents the 
GlcNAc2Man3GlcNAc2 A^-glycan species with the plant-specific P(l,2)-xylose residue 
attached. "GnGnXF" represents the GlcNAc2Man3GlcNAc2 A^-glycan species with 
the plant-specific p(l,2)-xylose residue and plant-specific a(l,3)-fucose residue 

10 attached. 

Figure 19 shows that the wild-type mAblOl construct (shown in Figure 15) 
providing for expression of the mAbI monoclonal IgGl antibody in L minor, without 
RNAi suppression of L. minor FucT and XylT, produces an N-glycosylation profile 
with three major N-glycan species, including one species having the pl,2-linked 

15 xylose and one species having both the pi,2-linked xylose and core a 1,3 -linked 

fucose residues; this profile is confirmed with liquid chromatography mass 
spectrometry (LC-MS) (Figure 20) and MALDI (Figure 21) analysis. 

Figure 22 shows an overlay of the relative amounts of the various A^-glycan 
species of mAbI produced in the wild-type L minor line comprising the mAblOl 

20 construct (no suppression of FucT or XylT) and in the two transgenic L minor lines 

comprising the mAbI04 construct of Figure 12 (providing for coexpression of mAbI 
and the chimeric RNAi construct targeting both L minor FucT and XIT). Note the 
enrichment of the GnGn (i.e., GO) glycan species, with no pi,2-linked xylose or core 
al,3-linked fiicose residues attached, and the absence of the species having the pi,2- 

25 linked xylose or both the pi,2-linked xylose and core al,3-linked fucose residues, 

"MGn" represents an A'^glycan precursor, wherein the trimannose core structure, 
Man3GlcNAc2, has one N-acetylglucosamine attached to the 1,3 mannose arm. 
"GnM" represents an A^-glycan precursor, wherein the trimannose core structure, 
Man3GlcNAc2, has one N-acetylglucosamine attached to the 1,6 mannose arm. These 

30 N-glycan precursors represent a trace amount of the total A^glycans present in the 

sample. This profile is confirmed with mass spec (LC-MS) (Figure 23) and MALDI 
(Figure 24) analysis. 
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Figure 25 shows intact mass analysis of mAbI compositions produced in wild- 
type L minor (line 20) comprising the mAblOl construct. When XylT and FucT 
expression are not suppressed in L minor, the recombinantly produced mAbI 
composition is heterogeneous, comprising at least 9 different glycoforms, with the 
5 GOXF^ glycoform being the predominate species present. Note the very minor peak 

representing the GO glycoform. 

Figure 26 shows intact mass analysis of mAbI compositions produced in 
transgenic L minor (line 1 5) comprising the mAbI04 construct of Figure 12. When 
XylT and FucT expression are suppressed in L. minor using this chimeric RNAi 

10 construct, the intact mAbI composition is substantially homogeneous for GO A^- 

glycans, with only trace amounts of precursor A^-glycans present (represented by the 
GnM and MGn precursor glycan species). In addition, the mAbI composition is 
substantially homogeneous for the GO glycoform, wherein both glycosylation sites are 
occupied by the GO iV-glycan species, with three minor peaks reflecting trace amounts 

1 5 of precursor glycoforms (one peak showing mAbI having an Fc region wherein the 

Ch2 domain of one heavy chain has a GO glycan species attached to Asn 297, and the 
Ch2 domain of the other heavy chain is unglycosylated; another peak showing mAbI 
having an Fc region wherein the Ch2 domain of one heavy chain has a GO glycan 
species attached to Asn 297, and the Ch2 domain of the other heavy chain has the 

20 GnM or MGn precursor glycan attached to Asn 297; and another peak showing mAbI 

having an Fc region wherein the Asn 297 glycosylation site on each of the Ch2 
domains has a GO glycan species attached, with a third GO glycan species attached to 
an additional glycosylation site within the mAbI structure). 

Figure 27 shows intact mass analysis of the mAbI compositions produced in 

25 transgenic L minor (line 72) comprising the mAbI05 construct of Figure 13. When 

XylT and FucT expression are suppressed in L. minor using this construct, the intact 
mAbl composition is substantially homogeneous for GO iV-glycans, with only trace 
amounts of precursor iV-glycan species present (represented by the GnM and MGn 
precursor glycan species). In addition, the mAbl composition is substantially 

30 homogeneous (at least 90%) for the GO glycoform, with the same three minor peaks 

reflecting precursor glycoforms as obtained with the mAbI04 construct. 

Figure 28 summarizes two possible designs for targeting expression of 
individual FucT and XylT genes. 



-9- 



wo 2007/084922 



PCTAJS2007/060642 



Figure 29A shows the common oligomannosidic core structure of complex N- 
glycans of glycoproteins produced in plants and animals. In mammals, the core 
structure can include a fucose residue in which 1 -position of the fucose is bound to 6- 
position of the N-acetylgucosamine in the reducing end through an a bond (i.e., 
5 a(l,6)-linked fucose). Figure 29B shows the plant-specific modifications to these N- 

glycans. The mammalian R groups can be one of the following: (a) 
R=GlcNAcP(l,2); (b) R=Gaip(l,4)-GlcNAcp(l,2); (c) R=NeuAca(2,3)-Gaip(l,4)- 
GlcNAcP(l,2); (d) R=NeuGca(2,3>Gaip(1.4)-GlcNAcP(l,2); and (e) R=Gala(l,3)- 
Gaip(l,4)-GlcNAcP(l,2). The plant R groups can be one of the following: (a) 
10 R=null; (b) R=GlcNAcp(l,2); 

(c), R=Gaip(l,3)-GlcNAcP(l,2) and (d) R=Gaip(l,4>GlcNAcP(l,2). 

I I 
Fuca(l,4) Fuca(l,6). 
Abbreviations: Man, mannose; GlcNAc, N-acetylglucosamine; Xyl, xylose; Fuc, 
1 5 fucose; Gal, galactose; NeuAc (neuraminic acid (sialic acid); *, reducing end of sugar 

chain that binds to asparagine. 

Figure 30 shows the GO, GOX, and GOXF"' species of A^-linked glycans of 
glycoproteins referred to in the description and claims of the present invention, along 
with the alternate nomenclature used herein. 
20 Figure 3 1 sets forth the partial cDNA (SEQ ID NO: 19; coding sequence set 

forth in SEQ ID NO:20) sequence for the Lemna /w/war pi,2-xylosyltransferase 
(XylT) isoform #2, and partial amino acid (SEQ ID N0:21) sequence encoded 
thereby. Nucleotides denoted by the single underline (— ) correspond to the XylT 
forward fragment within the RNAi expression cassette designed to inhibit expression 
25 of XylT (see Figure 33); nucleotides denoted by the double underline (=) correspond 

to the spacer sequence within this RNAi expression cassette. The XylT reverse 
fragment of the RNAi expression cassette is the antisense of the XylT forward 
fragment shown here. 

Figure 32 sets forth an alignment of the Lemna minor XylT isoform #1 of SEQ 
30 ID N0:6 with the Lemna minor partial-length XylT isoform #2 of SEQ ID N0:2 1. 

Figure 33 sets forth one strategy for designing a single-gene RNAi knockout 
of Lemna minor XylT based on the partial DNA sequence for XylT isoform #2. 
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Figure 34 sets forth one strategy for designing a double-gene RNAi knockout 
of Lemna minor FucT and XylT, where the XylT portion of the RNAi knockout is 
based on the partial DNA sequence for XylT isoform #2. 

Figure 35 shows receptor binding activity of the CHO-derived and SP2/0- 
5 derived niAbl product for the FcyRIIIa on freshly isolated human NK cells. 

Figure 36 shows receptor binding activity of the wild-type Lemna-dQrivQd 
mAbI product and the transgenic Lemna-dQuwed mAbI product for the FcyRIIIa on 
freshly isolated human NK cells collected from Donor 1 . 

Figure 37 shows receptor binding activity of the wild-type Zemna-derived 
10 mAbl product and the transgenic Z^m«a-derived mAbl product for the FcyRIIIa on 

freshly isolated human NK cells collected from Donors 2 and 3. 

Figure 38 shows receptor binding activity of the Sp2/0-derived mAbl product, 
the wild-type Z^mrta-derived mAbl product, and the transgenic Lemna-dQnved mAbl 
product for the mouse FcyRIV. 
1 5 Figure 39 shows a diagram of the MDXA04 binary expression vector for 

RNAi silencing of FucT and XylT activity in Lemna. Hatched regions show the 
position of the heavy (H) and light (L) chain variable region gene sequences of fully 
human mAbl kappa antibody MDX-060 and the chimeric hairpin RNA (ElNAi) 
designed to target silencing of endogenous Lemna genes encoding FucT and XylT. 
20 Promoters: PI, P2, and P3; terminator: T; selectable marker: SM; left border LB; 

right border, RB. The MDXAOl expression vector used to express the MDX-060 
mAb in wild-type Lemna did not contain the hairpin RNA region. 

Figure 40 shows glycosyltransferase activity in Lemna wild-type and MDX- 
060 LEX^* RNAi lines. Microsomal membranes from wild-type (WT) and MDX- 
25 060 LEX^* RNAi (line numbers are indicated) plants were incubated in the presence 

of a reaction buffer containing GDP-Fuc, UDP-Xyl and GnGn-dabsyl-peptide 
acceptor. Mass peaks corresponding to fucosylated (white bars) or xylosylated (black 
bars) products synthesized by microsomes from each line were measured by positive 
reflectron mode MALDI-TOF MS and normalized, in percent, to the WT positive 
30 control. Boiled wildtype membranes (BWT) indicate background ion counts. 

Figure 41 shows SDS-PAGE of plant extracts and protein A or hydroxyapatite 
purified samples from MDX-060 LEX^^ under non-reducing (Figure 41 A) and 
reducing (Figure 4 IB) conditions, respectively. MAb pimfied from a CHO cell line 
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(MDX-060 CHO) was used as a positive control Mark 12 molecular weight markers 
were included on the gels. Gels were stained with Colloidal Blue. 

Figure 42 shows the spectra obtained from negative, reflectron mode MALDI- 
TOF mass spectrometric analysis of 2-A A labeled //-glycans released from MDX-060 
mAbs expressed in CHO (MDX-060 CHO). wild-type Lemna (MDX-060 LEX), or 
Lemna transformed with the XylT/FucT RNAi construct (MDX-060 LEX^^). 
Significant peaks are identified by the corresponding mass ([M-H]'). The * indicates 
the location of matrix artefacts. 

Figure 43 shows the spectra obtained from NP-HPLC-QTOF MS analysis of 
2-AA labeled A^-glycans released from MDX-060 mAbs expressed in CHO (MDX- 
060 CHO), wild-type Lemna (MDX-060 LEX), or Lemna transformed with the 
XylT/FucT RNAi construct (MDX-060 LEX^'). 2-AA labeled iV-glycans were 
separated by normal phase chromatography and detected by fluorescence. The most 
abundant peaks from each sample (labeled a-i) were characterized by on-line negative 
mode QTOF MS and their corresponding QTOF mass spectra ([M-2H]^ ) are shown. 

Figure 44 shows in vitro activity of MDX-060 mAbs as measured by flow 
cytometric analysis of MDX-060 CHO, LEX, or glyco-optimized LEX^' mAb 
binding to CD30 expressed on L540 cells. L540 cells were incubated with increasing 
concentrations of the indicated antibody as outlined in Example 6 herein below. Geo 
Mean Fluorescence Intensity (GMFI) is plotted against the various concentrations of 
mAb used. ■: MDX-060 CHO; ▲: MDX-060 LEX;V: MDX-060 LEX"^'. 

Figure 45 shows equilibrium binding of glyco-optimized and wild-type mAb 
to two different human FcRyllla allotypes (Val^^^ or Phe^^^), The binding signal as a 
function of FcRyllla was fit to a one-site binding model ■: MDX-060 CHO; ▲: 
MDX-060 LEX;T: MDX-060 LEX°P^ 

Figure 46 shows ADCC activity of MDX-060 niAb derived from CHO, LEX 
(wild-type Lemna glycosylation), or LEX^^^ (RNAi transgenic Lemna), Human 
effector cells from a FcyRIIIaPhe^^^ homozygote donor and a FcyRHIaPheA^al^^^ 
heterozygote donor were incubated with BATDA-labeled L540 cells at an 
effector:target ratio of 50: 1 in the presence of increasing concentrations of the 
indicated antibodies. Specific percent lysis at each mAb concentration is plotted. 
Human mAbl not recognizing antigen on L540 cells was used as an isotype control in 
all experiments. EC50 values (jig/mL), binding constants and maximal percent lysis 
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were calculated using GraphPad Prism 3.0 software. ■: MDX-060 CHO; ▲: MDX- 
060 LEX;T: MDX-060 LEX*^. 

Figure 47 shows intact mass analysis of the MDX-060 LEX mAb 
compositions produced in wild-type L minor comprising the MDXAOl construct 
5 When XylT and FucT expression are not suppressed in L minor, the recombinantly 

produced MDX-060 LEX mAb composition comprises at least 7 different 

•J 

glycofonns, with the GOXF glycoform being the predominate species present. Note 
the absence of a peak representing the GO glycoform. 

Figure 48 shows glycan mass analysis of the heavy chain of the MDX-060 

10 LEX mAb produced in wild-type L minor comprising the MDXAOl construct. When 

XylT and FucT expression are not suppressed in L minor, the predominate A^-glycan 
species present is G0XF\ with additional major peaks reflecting the GOX species. 
Note the minor presence of the GO glycan species. 

Figure 49 shows intact mass analysis of the MDX-060 LEX*^^ mAb 

1 5 compositions produced in transgenic L minor comprising the MDXA04 construct. 

When XylT and FucT expression are suppressed in L minor, the intact mAb 
composition contains only GO A^-glycans. In addition, the composition is substantially 
homogeneous for the GO glycoform (peak 2), wherein both glycosylation sites are 
occupied by the GO A^-glycan species, with two minor peaks reflecting trace amounts 

20 of precursor glycoforms (peak 1 , showing mAb having an Fc region wherein the Ch2 

domain of one heavy chain has a GO glycan species attached to Asn 297, and the Ch2 
domain of the other heavy chain is unglycosylated; and peak 3, showing mAb having 
an Fc region wherein the Asn 297 glycosylation site on each of the Ch2 domains has a 
GO glycan species attached, with a third GO glycan species attached to an additional 

25 glycosylation site within the mAb structure). 

Figure 50 shows glycan mass analysis of the heavy chain of the MDX-060 
LEX^^ mAb produced in transgenic L minor comprising the MDXA04 construct. 
When XylT and FucT expression are suppressed in L minor, the only readily 
detectable //-glycan species attached to the Asn 297 glycosylation sites of the Ch2 

30 domains of the heavy chains is GO. 

Figures 51 A (MALDl analysis) and 5 IB (HPLC analysis) show that the 
homogeneous glycosylation profile exhibited by mAbI produced in transgenic L 
minor (line 24) comprising the mAbI04 RNAi construct was consistently observed 
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with scaled-up production. This glycosylation profile was consistent over the 8- 
month period of continuous maintenance of the transgenic line via clonal expansion. 

Figure 52 shows that suppression of FucT and XylT expression using the 
chimeric RNAi mAbI04 construct of Figure 12 results in endogenous glycoproteins 
5 having a homogeneous glycosylation pattern consistent with that observed for 

recombinant glycoproteins. For this figure, the pi,2-linked xylose residue attached to 
the trimannose core structure is designated by the star symbol. 

Figure 53 shows the structure of complex A^-glycans described in Example 6 
below. M = mannose; Gn = A^-acetylglucosamine; A = galactose; X = xylose; F = 
10 fucose. 



DETAILED DESCRIPTION OF THE INVENTION 
The present inventions now will be described more fully hereinafter with 
reference to the accompanying drawings, in which some, but not all embodiments of 

15 the inventions are shown. Indeed, these inventions may be embodied in many 

different forms and should not be construed as limited to the embodiments set forth 
herein; rather, these embodiments are provided so that this disclosure will satisfy 
applicable legal requirements. Like numbers refer to like elements throughout 

Many modifications and other embodiments of the inventions set forth herein 

20 will come to mind to one skilled in the art to which these inventions pertain having 

the benefit of the teachings presented in the foregoing descriptions and the associated 
drawings. Therefore, it is to be understood that the inventions are not to be limited to 
the specific embodiments disclosed and that modifications and other embodiments are 
intended to be included within the scope of the appended claims. Although specific 

25 terms are employed herein, they are used in a generic and descriptive sense only and 

not for purposes of limitation. 

The present invention provides compositions and methods for altering the N- 
glycosylation pattern of homologous and heterologous polypeptides produced in a 
plant, particularly a plant that serves as an expression system for recombinant proteins 

30 of interest. The methods comprise the use of nucleotide constructs comprising one or 

more sequences that are capable of inhibiting expression of al,3-fucosyltransferase 
(FucT) and pi ,2-xylosyltransferase (XylT) in a plant. Compositions for use in 
practicing the methods of the invention, and compositions comprising glycoproteins 
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having a substantially homologous glycosylation profile, including glycan-optimized 
monoclonal antibodies with improved effector function, are also provided. 

Definitions: 

5 "Polypeptide" refers to any monomeric or multimeric protein or peptide. 

"Biologically active polypeptide" refers to a polypeptide that has the 
capability of performing one or more biological functions or a set of activities 
normally attributed to the polypeptide in a biological context. Those skilled in the art 
will appreciate that the term "biologically active" includes polypeptides in which the 

10 biological activity is altered as compared with the native protein (e.g., suppressed or 

enhanced), as long as the protein has sufficient activity to be of interest for use in 
industrial or chemical processes or as a therapeutic, vaccine, or diagnostics reagent. 
Biological activity can be determined by any method available in the art. For 
example, the biological activity of members of the interferon family of proteins can be 

15 determined by any of a number of methods including their interaction with interferon- 

specific antibodies, their ability to increase resistance to viral infection, or their ability 
to modulate the transcription of interferon-regulated gene targets. In like manner, 
biological activity of monoclonal antibodies can be determined by any of a number of 
methods including, but not limited to, assays for measuring binding specificity and 

20 effector function, for example, using assays for antibody-dependent cellular 

cytotoxicity (ADCC) and complement-dependent cytotoxicity (CDC) activity. 

By "host cell" is intended a cell that comprises a heterologous nucleic acid 
sequence of the invention. Though the nucleic acid sequences of the invention, and 
fragments and variants thereof, can be introduced into any cell of interest, of 

25 particular interest are plant host cells. In some embodiments, the plant host cells are 

cells of a plant that serves as a host for expression of recombinant proteins, for 
example, a plant expression system for production of recombinant mammalian 
proteins of interest as noted herein below. 

By "heterologous polypeptide of interest" is intended a polypeptide that is not 

30 expressed by the host cells in nature. Conversely, a "homologous polypeptide" is 

intended a polypeptide that is naturally produced within the cells of the host. 
Heterologous and homologous polypeptides that undergo post-translational A^- 
glycosylation are referred to herein as heterologous or homologous glycoproteins. In 
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accordance with the methods of the present invention, the //-glycosylation pattern of 
both heterologous and homologous glycoproteins is altered within the cells of a plant 
host so that these glycoproteins have an A'-glycosylation pattern that is more similar to 
that observed with mammalian hosts. 
5 Exemplary heterologous polypeptide and heterologous nucleotide sequences 

of interest include, but are not limited to, sequences that encode mammalian 
polypeptides, such as insulin, growth hormone, a-interferon, |J-interferon, P- 
glucocerebrosidase, P-glucoronidase, retinoblastoma protein, p53 protein, angiostatin, 
leptin, erythropoietin (EPO), granulocyte macrophage colony stimulating factor, 

10 plasminogen, tissue plasminogen activator, blood coagulation factors, for example, 

Factor vn, Factor Vni, Factor DC, and activated protein C, alpha 1 -antitrypsin, 
monoclonal antibodies (mAbs), Fab fragments, single-chain antibodies, cytokines, 
receptors, hormones, human vaccines, animal vaccines, peptides, and serum albumin. 
For purposes of the present invention, the terms 'W-glycan," 'W-linked 

15 glycan," and "glycan" are used interchangeably and refer to an AMinked 

oligosaccharide, e.g., one that is or was attached by an N-acetylglucosamine 
(GlcNAc) residue linked to the amide nitrogen of an asparagine residue in a protein. 
The predominant sugars found on glycoproteins are glucose, galactose, mannose, 
fucose, N-acetylgalactosamine (GalNAc), N-acetylglucosamine (GlcNAc), and sialic 

20 acid (e.g., N-acetyl-neuraminic acid (NeuAc)). The processing of the sugar groups 

occurs cotranslationally in the lumen of the ER and continues in the Golgi apparatus 
for ^-linked glycoproteins. 

By "oligomannosidic core structure" or "trimannose core structure" of a 
complex A^-glycan is intended the core structure shown in Figure 29A, wherein the 

25 core comprises three mannose (Man) and two N-acetylglucosamine (GlcNAc) 

monosaccharide residues that are attached to the asparagine residue of the 
glycoprotein. The asparagine residue is generally within the conserved peptide 
sequence Asn-Xxx-Thr or Asn-Xxx-Ser, where Xxx is any residue except proline, 
aspartate, or glutamate. Subsequent glycosylation steps yield the final complex N- 

30 glycan structure. The trimannose core structure is denoted herein as "Man3GlcNAc2." 

The A^-glycans attached to glycoproteins differ with respect to the number of 
branches (antennae) comprising peripheral sugars (e.g., GlcNAc, galactose, fucose, 
and sialic acid) that are added to the trimannose core structure, //-glycans are 
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commonly classified according to their branched constituents (e.g., complex, high 
mannose, or hybrid). A "complex" type A^-glycan typically has at least one GlcNAc 
attached to the 1,3 mannose arm and at least one GlcNAc attached to the 1,6 mannose 
arm of a *'trimannose" core. Where one GlcNAc is attached to each mannose arm, the 
5 species of A^-linked glycan is denoted herein as "GlcNAcaManaGlcNAcz" or "GnGn." 

Where only one GlcNac is attached, the //-glycan species is denoted herein as 
"GlcNAc iMan3GlcNAc2", wherein the GlcNac is attached to either the 1,3 mannose 
arm (denoted "MGn" herein) or the 1,6 mannose arm (denoted "GnM" herein) (see 
Figure 30). Complex A^glycans may also have galactose ("Gal") or N- 

10 acetylgalactosamine ("GalNAc") sugar residues that are optionally modified with 

sialic acid or derivatives (e.g., "NeuAc," where '*Neu" refers to neuraminic acid and 
"Ac" refers to acetyl). Where a galactose sugar residue is attached to each GlcNAc 
on each mannose arm, the species of A/-linked glycan is denoted herein as 
"Gal2GlcNAc2Man3GlcNAc2." Complex iV-glycans may also have intrachain 

1 5 substitutions comprising "bisecting" GlcNAc and core fiicose ("Fuc"). Complex A^- 

glycans may also have multiple antennae on the "trimannose core," often referred to 
as "multiple antennary glycans." A "high mannose" type N-glycan has five or more 
mannose residues. A "hybrid" iV-glycan has at least one GlcNAc on the terminal of 
the 1,3 mannose arm of the trimannose core and zero or more mannoses on the 1,6 

20 mannose arm of the trimannose core. 

The terms "GO glycan" and "GO glycan structure" and "GO glycan species" are 
used interchangeably and are intended to mean the complex A^-linked glycan having 
the GlcNAc2Man3GlcNAc2 structure, wherein no terminal sialic acids (NeuAcs) or 
terminal galactose (Gal) sugar residues are present. If a GO glycan comprises a fucose 

25 ("Fuc") residue attached to the trimannose core structure, it is referred to herein as a 

"GOF^ glycan" (having the plant-specific al,3-linked fucose residue) or "GOF^ 
glycan" (having the mammalian al,6-linked fucose residue). In plants, a GO glycan 
comprising the plant-specific |31,2-hnked xylose residue attached to the trimannose 
core structure is referred to herein as a "GOX glycan," and a GO glycan comprising 

30 both the plant-specific pi ,2-linked xylose residue and plant-specific al,3-linked 

fucose residue attached to the trimannose core structure is referred to herein as a 
"GOXF^ glycan." 
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The terms "Gl glycan" and "Gl glycan structure" and "Gl glycan species" are 
used interchangeably and are intended to mean the complex //-linked glycan having 
the GlcNAcaManaGlcNAca structure, wherein one terminal galactose (Gal) residue is 
attached to either the 1,3 mannose or 1,6 mannose arm, and no terminal sialic acids 
5 are present. The terms "G2 glycan" and "G2 glycan structure" and "G2 glycan 

species" are used interchangeably and are intended to mean the complex A^-linked 
glycan having the GlcNAc2Man3GlcNAc2 structure, wherein a terminal galactose 
(Gal) residue is attached to the 1,3 mannose arm and the 1,6 mannose arm, and no 
terminal sialic acids are present 

10 The term "glycoform" as used herein refers to a glycoprotein containing a 

particular carbohydrate structure or structures. Thus, for example, a "GO glycoform" 
refers to a glycoprotein that comprises only GO glycan species attached to its 
glycosylation sites. It is recognized that a glycoprotein having more than one 
glycosylation site can have the same glycan species attached to each glycosylation 

1 5 site, or can have different glycan species attached to different glycosylation sites. In 

this manner, different patterns of glycan attachment yield different glycoforms of a 
glycoprotein. 

The term "glycosylation profile" is intended to mean the characteristic 
"fingerprint" of the representative A^-glycan species that have been released from a 
20 glycoprotein composition or glycoprotein product, either enzymatically or chemically, 

and then analyzed for their carbohydrate structure, for example, using LC-HPLC, or 
MALDI-TOF MS, and the like. See, for example, the review in Current Analytical 
Chemistry, Vol. 1, No. 1 (2005), pp. 28-57; herein incorporated by reference in its 
entirety. 

25 The terms "substantially homogeneous," "substantially uniform," and 

"substantial homogeneity" in the context of a glycosylation profile for a glycoprotein 
composition or glycoprotein product are used interchangeably and are intended to 
mean a glycosylation profile wherein at least 80%, at least 85%, at least 90%, 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% of the total A^-glycan species 

30 within the profile are represented by one desired A^-glycan species, with a trace 

amount of precursor //-glycan species appearing in the profile. By "trace amount" is 
intended that any given precursor A^-glycan species that is present in the glycosylation 
profile is present at less than 5%, preferably less than 4%, less than 3%, less than 2%, 
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less than 1%, and even less than 0.5% or even less than 0. 1% of the total amount of 
A^-glycan species appearing in the profile. By "precursor" A/^-glycan species is 
intended an ^-glycan species that is incompletely processed. Examples of precursor 
A^-glycan species present in trace amounts in the glycoprotein compositions or 
5 glycoprotein products of the invention, and thus appearing in the glycosylation 

profiles thereof, are the Man3GlcNAc2, MGn (GlcNaclMan3GlcNAc2 wherein 
GlcNacl is attached to the 1,3 mannose arm), and GnM (GlcNaclMan3GlcNAc2 
wherein GlcNacl is attached to the 1 ,6 mannose arm) precursor A/-glycan species 
described above. 

10 Thus, for example, where the desired A^-glycan species within a glycoprotein 

product or composition is GO, a substantially homogeneous glycosylation profile for 

that product or composition would be one wherein at least 80%, at least 85%, at least 

90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% of the total 

amount of A^-glycan species appearing in the glycosylation profile for the product or 

IS composition is represented by the GO glycan species, with a trace amount of precursor 

* 

A^-glycan species appearing in the glycosylation profile. For such a composition, a 
representative precursor //-glycan species appearing in its glycosylation profile would 
be the Man3GlcNAc2, MGn (GlcNacl Man3GlcNAc2 wherein GlcNacl is attached to 
the 1,3 mannose arm), and GnM (GlcNacl Man3GlcNAc2 wherein GlcNacl is 

20 attached to the 1,6 mannose arm) precursor //-glycan species described above. 

The terms "substantially homogeneous," "substantially uniform," and 
"substantial homogeneity" in the context of a glycoprotein composition or 
glycoprotein product are used interchangeably and are intended to mean the 
glycoprotein product or glycoprotein composition wherein at least 80%, at least 85%, 

25 at least 90%, 9 1 %, 92%, 93%, 94%, 95%. 96%, 97%, 98%, or at least 99% of the 

glycoprotein present in the product or composition is represented by one desired 
glycoform, with a trace amount of precursor or undesired glycoforms being present in 
the composition. By "trace amount" is intended that any given precursor or undesired 
glycoform that is present in the glycoprotein product or composition is present at less 

30 than 5%, preferably less than 4%, less than 3%, less than 2%, less than 1%, and even 

less than 0.5% or even less than 0, 1% of the total glycoprotein. By "precursor" 
glycoform is intended a glycoform wherein at least one glycosylation site is either 
unglycosylated, or is occupied by an A^-glycan species that represents a precursor of 
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the desired A^-glycan species, or a glycoform wherein one or more additional 
glycosylation sites is present, relative to the desired glycoform, and is occupied by 
(i.e., has attached thereto) the desired A^-glycan species or an undesired A/^glycan 
species. 

5 Thus, for example, a substantially homogeneous glycoprotein composition or 

product comprising the GO glycoform is a composition or product wherein at least 
80%, 80%, at least 85%, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 
or at least 99% of the glycoprotein present in the product or composition is 
represented by the GO glycoform, wherein all anticipated glycosylation sites are 

1 0 occupied by the GO glycan species, with a trace amount of precursor or undesired 

glycoforms being present in the composition. In such a composition, a representative 
precursor glycoform would be one in which glycosylation sites are unoccupied, and 
an exemplary undesired glycoform would be a glycoform having a mixture of GO 
glycan and GOX or G0XF3 glycan species attached to its glycosylation sites. 

1 5 The term ''antibody'' is used in the broadest sense and covers fully assembled 

antibodies, antibody fragments that can bind antigen (e.g.. Fab', FXab)2, Fv, single 
chain antibodies, diabodies), and recombinant peptides comprising the foregoing. 
Antibodies represent one of the many glycoproteins contemplated by the methods and 
compositions of the present invention. Derivatives of antibodies are also 

20 contemplated by the present invention. Derivatives include fusion proteins 

comprising an immunoglobulin or portion thereof, such as an Fc region having a Ch2 
domain. 

The term "monoclonal antibody" as used herein refers to an antibody obtained 
from a population of substantially homogeneous antibodies, i.e., the individual 

25 antibodies comprising the population are identical except for possible naturally 

occurring mutations that may be present in minor amounts. 

"Native antibodies" and "native immunoglobulins" are usually 
heterotetrameric glycoproteins of about 150,000 daltons, composed of two identical 
light (L) chains and two identical heavy (H) chains. Each light chain is linked to a 

30 heavy chain by one covalent disulfide bond, while the number of disulfide linkages 

varies among the heavy chains of different immimoglobulin isotypes. Each heavy and 
light chain also has regularly spaced intrachain disulfide bridges. Each heavy chain 
has at one end a variable domain (Vh) followed by a number of constant domains. 
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Each light chain has a variable domain at one end (Vl) and a constant domain at its 
other end; the constant domain of the light chain is aligned with the first constant 
domain of the heavy chain, and the light chain variable domain is aligned with the 
variable domain of the heavy chain. Particular amino acid residues are believed to 
5 form an interface between the light- and heavy-chain variable domains. 

The term "variable" refers to the fact that certain portions of the variable 
domains differ extensively in sequence among antibodies and are used in the binding 
and specificity of each particular antibody for its particular antigen. However, the 
variability is not evenly distributed throughout the variable domains of antibodies. It 

10 is concentrated in three segments called complementarity determining regions (CDRs) 

or hypervariable regions both in the light-chain and the heavy-chain variable domains. 
The more highly conserved portions of variable domains are called the framework 
(FR) regions. The variable domains of native heavy and light chains each comprise 
four FR regions, largely adopting a p-sheet configuration, connected by three CDRs, 

IS which form loops connecting, and in some cases forming part of, the P-sheet structure. 

The CDRs in each chain are held together in close proximity by the FR regions and, 
with the CDRs from the other chain, contribute to the formation of the antigen- 
binding site of antibodies (see Kabat et al {\99\) NIH PubL No. 91-3242, Vol. I, 
pages 647-669). 

20 The constant domains are not involved directly in binding an antibody to an 

antigen, but exhibit various effecter functions, such as Fc receptor (FcR) binding, 
participation of the antibody in antibody-dependent cellular cytotoxicity (ADCC), 
opsonization, initiation of complement-dependent cytotoxicity (CDC activity), and 
mast cell degranulation. 

25 "Antibody fragments" comprise a portion of an intact antibody, preferably the 

antigen-binding or variable region of the intact antibody. Examples of antibody 
fragments include Fab, Fab', F(ab')2, and Fv fragments; diabodies; linear antibodies 
(Zapata et al (1995) Protein Eng. 8(10): 1057-1062); single-chain antibody 
molecules; and multispecific antibodies formed from antibody fragments. Papain 

30 digestion of antibodies produces two identical antigen-binding fragments, called 

"Fab" fragments, each with a single antigen-binding site, and a residual "Fc" 
fragment, whose name reflects its ability to crystallize readily. Pepsin treatment 
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yields an F(ab')2 fragment that has two antigen-combining sites and is still capable of 
cross-linking antigen. 

"Fv" is the minimum antibody fragment that contains a complete antigen 
recognition and binding site. In a two-chain Fv species, this region consists of a 
5 dimer of one heavy- and one light-chain variable domain in tight, non-covalent 

association. In a single-chain Fv species, one heavy- and one light-chain variable 
domain can be covalently linked by flexible peptide linker such that the light and 
heavy chains can associate in a "dimeric*' structure analogous to that in a two-chain 
Fv species. It is in this configuration that the three CDRs of each variable domain 

10 interact to define an antigen-binding site on the surface of the Vh-Vl dimer. 

Collectively, the six CDRs confer antigen-binding specificity to the antibody. 
However, even a single variable domain (or half of an Fv comprising only three 
CDRs specific for an antigen) has the ability to recognize and bind antigen, although 
at a lower affinity than the entire binding site. 

IS The Fab fragment also contains the constant domain of the light chain and the 

first constant domain (ChI) of the heavy chain. Fab fragments differ from Fab' 
fragments by the addition of a few residues at the carboxy terminus of the heavy- 
chain ChI domain including one or more cysteines from the antibody hinge region. 
Fab'-SH is the designation herein for Fab* in which the cysteine residue(s) of the 

20 constant domains bear a free thiol group. F(ab')2 antibody fragments originally were 

produced as pairs of Fab' fragments that have hinge cysteines between them. Other 
chemical couplings of antibody fragments are also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate 
species can be assigned to one of two clearly distinct types, called kappa (k) and 

25 lambda (k), based on the amino acid sequences of their constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy 
chains, immunoglobulins can be assigned to different classes. There are five major 
classes of human immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of these 
may be fiirther divided into subclasses (isotypes), e.g., IgGl, IgG2, IgG3, IgG4, IgA, 

30 and IgA2. The heavy-chain constant domains that correspond to the different classes 

of immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. 
The subunit structures and three-dimensional configurations of different classes of 
immunoglobulins are well known. Different isotypes have different effector 
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functions. For example, human IgGl and IgG3 isotypes mediate antibody-dependent 
cell-mediated cytotoxicity (ADCC) activity. 

Immunoglobulins have conserved AT-linked glycosylation of the Fc region of 
each of the two heavy chains. Thus, for example, immunoglobulins of the IgG type 
5 have glycosylated Ch2 domains bearing N-linked oligosaccharides at asparagine 297 

(Asn-297). Different glycoforms of inununoglobulins exist depending upon the 
particular //-glycan species attached to each of these two glycosylation sites, and 
depending upon the degree to which both sites are glycosylated within an 
immunoglobulin composition. 

10 "Nucleotide sequence of interest*' as used herein with reference to expression 

of heterologous polyeptides refers to any polynucleotide sequence encoding a 
heterologous polypeptide intended for expression in a host, particularly a plant host, 
for example, in a higher plant, including members of the dicotyledonaceae and 
monocotyledonaccae. For example, polynucleotide sequences encoding therapeutic 

15 (e.g., for veterinary or medical uses) or immunogenic (e.g., for vaccination) 

polypeptides can be expressed using transformed plant hosts, for example, duckweed, 
according to the present invention. 

The use of the term "polynucleotide" is not intended to limit the present 
invention to polynucleotides comprising DNA. Those of ordinary skill in the art will 

20 recognize that polynucleotides can comprise ribonucleotides and combinations of 

ribonucleotides and deoxyribonucleotides. Such deoxyribonucleotides and 
ribonucleotides include both naturally occurring molecules and synthetic analogues. 
The polynucleotides of the invention also encompass all forms of sequences 
including, but not limited to, single-stranded forms, double-stranded forms, hairpins, 

25 stem-and-loop structures, and the like. 

The terms "inhibit," "inhibition," and "inhibiting" as used herein refer to any 
decrease in the expression or function of a target gene product, including any relative 
decrement in expression or function up to and including complete abrogation of 
expression or function of the target gene product. The term "expression" as used 

30 herein in the context of a gene product refers to the biosynthesis of that gene product, 

including the transcription and/or translation and/or assembly of the gene product. 
Inhibition of expression or function of a target gene product (i.e., a gene product of 
interest) can be in the context of a comparison between any two plants, for example. 
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expression or function of a target gene product in a genetically altered plant versus the 
expression or function of that target gene product in a corresponding wild-type plant. 
Alternatively, inhibition of expression or function of the target gene product can be in 
the context of a comparison between plant cells, organelles, organs, tissues, or plant 
5 parts within the same plant or between plants, and includes comparisons between 

developmental or temporal stages within the same plant or between plants. Any 
method or composition that down-regulates expression of a target gene product, either 
at the level of transcription or translation, or down-regulates functional activity of the 
target gene product can be used to achieve inhibition of expression or function of the 

1 0 target gene product. 

The term "inhibitory sequence" encompasses any polynucleotide or 
polypeptide sequence that is capable of inhibiting the expression of a target gene 
product, for example, at the level of transcription or translation, or which is capable of 
inhibiting the function of a target gene product. Examples of inhibitory sequences 

IS include, but are not limited to, full-length polynucleotide or polypeptide sequences, 

truncated polynucleotide or polypeptide sequences, fragments of polynucleotide or 
polypeptide sequences, variants of polynucleotide or polypeptide sequences, sense- 
oriented nucleotide sequences, antisense-oriented nucleotide sequences, the 
complement of a sense- or antisense-oriented nucleotide sequence, inverted regions of 

20 nucleotide sequences, hairpins of nucleotide sequences, double-stranded nucleotide 

sequences, single-stranded nucleotide sequences, combinations thereof, and the like. 
The term "polynucleotide sequence" includes sequences of RN A, UNA, chemically 
modified nucleic acids, nucleic acid analogs, combinations thereof, and the like. 

It is recognized that inhibitory polynucleotides include nucleotide sequences 

25 that directiy (i.e., do not require transcription) or indirectly (i.e., require transcription 

or transcription and translation) inhibit expression of a target gene product. For 
example, an inhibitory polynucleotide can comprise a nucleotide sequence that is a 
chemically synthesized or in v/7ro-produced small interfering RNA (siRNA) or micro 
RNA (miRNA) that, when introduced into a plant cell, tissue, or organ, would 

30 directiy, though transientiy, silence expression of the target gene product of interest. 

Alternatively, an inhibitory polynucleotide can comprise a nucleotide sequence that 
encodes an inhibitory nucleotide molecule that is designed to silence expression of the 
gene product of interest, such as sense-orientation RNA, antisense RNA, double- 
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Stranded RNA (dsRNA), hairpin RNA (hpRNA), intron-containing hpRNA, catalytic 
RNA, miRNA, and the like. In yet other embodiments, the inhibitory polynucleotide 
can comprise a nucleotide sequence that encodes a mRNA, the translation of which 
yields a polypeptide that inhibits expression or function of the target gene product of 
S interest. In this manner, where the inhibitory polynucleotide comprises a nucleotide 

sequence that encodes an inhibitory nucleotide molecule or a mRNA for a 
polypeptide, the encoding sequence is operably linked to a promoter that drives 
expression in a plant cell so that the encoded inhibitory nucleotide molecule or 
mRNA can be expressed. 

10 Inhibitory sequences are designated herein by the name of the target gene 

product. Thus, for example, an "al,3-fucosyltransferase (FucT) inhibitory sequence" 
would refer to an inhibitory sequence that is capable of inhibiting the expression of a 
FucT, for example, at the level of transcription and/or translation, or which is capable 
of inhibiting the function of a FucT. Similarly, a "p 1 ,2-xy losyltransferase (XylT) 

15 inhibitory sequence" would refer to an inhibitory sequence that is capable of 

inhibiting the expression of a XylT, at the level of transcription and/or translation, or 
which is capable of inhibiting the fiinction of a XylT. When the phrase "capable of 
inhibiting" is used in the context of a polynucleotide inhibitory sequence, it is 
intended to mean that the inhibitory sequence itself exerts the inhibitory effect; or, 

20 where the inhibitory sequence encodes an inhibitory nucleotide molecule (for 

example, hairpin RNA, miRNA, or double-stranded RNA polynucleotides), or 
encodes an inhibitory polypeptide (i.e., a polypeptide that inhibits expression or 
function of the target gene product), following its transcription (for example, in the 
case of an inhibitory sequence encoding a hairpin RNA, miRNA, or double-stranded 

25 RNA polynucleotide) or its transcription and translation (in the case of an inhibitory 

sequence encoding an inhibitory polypeptide), the transcribed or translated product, 
respectively, exerts the inhibitory effect on the target gene product (i.e., inhibits 
expression or function of the target gene product). 

The term "introducing" in the context of a polynucleotide, for example, a 

30 nucleotide construct of interest, is intended to mean presenting to the plant the 

polynucleotide in such a manner that the polynucleotide gains access to the interior of 
a cell of the plant. Where more than one polynucleotide is to be introduced, these 
polynucleotides can be assembled as part of a single nucleotide construct, or as 
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separate nucleotide constructs, and can be located on the same or different * 
transformation vectors. Accordingly, these polynucleotides can be introduced into the 
host cell of interest in a single transformation event, in separate transformation events, 
or, for example, in plants, as part of a breeding protocol. The methods of the 

5 invention do not depend on a particular method for introducing one or more 

polynucleotides into a plant, only that the polynucleotide(s) gains access to the 
interior of at least one cell of the plant. Methods for introducing polynucleotides into 
plants are known in the art including, but not limited to, transient transformation 
methods, stable transformation methods, and virus-mediated methods. 

10 'Transient transformation" in the context of a polynucleotide is intended to 

mean that a polynucleotide is introduced into the plant and does not integrate into the 
genome of the plant. 

By "stably introducing" or "stably introduced" in the context of a 
polynucleotide introduced into a plant is intended the introduced polynucleotide is 

15 stably incorporated into the plant genome, and thus the plant is stably transformed 

with the polynucleotide. 

"Stable transformation" or "stably transformed" is intended to mean that a 
polynucleotide, for example, a nucleotide construct described herein, introduced into 
a plant integrates into the genome of the plant and is capable of being inherited by the 

20 progeny thereof, more particularly, by the progeny of multiple successive generations. 

In some embodiments, successive generations include progeny produced vegetatively 
(i.e., asexual reproduction), for example, with clonal propagation. In other 
embodiments, successive generations include progeny produced via sexual 
reproduction. A higher plant host that is "stably transformed" with at least one 

25 nucleotide construct that is capable of inhibiting expression of a FucT and/or XylT as 

described herein refers to a higher plant host that has the nucleotide construct(s) 
integrated into its genome, and is capable producing progeny, either via asexual or 
sexual reproduction, that also comprise the inhibitory nucleotide construct(s) stably 
integrated into their genome, and hence the progeny will also exhibit the desired 

30 phenotype of having an altered A^-glycosylation pattern characterized by a reduction 

in the attachment of al,3-fiicose and/or p 1 ,2-xylose residues to the A^-glycans of 
homologous and heterologous glycoproteins produced therein, 
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As used herein, the term "plant" includes reference to whole plants, plant 
organs (e.g., leaves, stems, roots, etc.), seeds, plant cells, and progeny of same. Parts 
of transgenic plants are to be understood within the scope of the invention to 
comprise, for example, plant cells, protoplasts, tissues, callus, embryos as well as 
5 flowers, ovules, stems, fruits, leaves, roots, root tips, and the like originating in 

transgenic plants or their progeny previously transformed with a DNA molecule of the 
invention and therefore consisting at least in part of transgenic cells. As used herein, 
the term "plant cell" includes, without limitation, cells of seeds, embryos, 
meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, 

10 pollen, and microspores. 

The class of plants that can be used in the methods of the invention is 
generally as broad as the class of higher plants amenable to transformation 
techniques, including both monocotyledonous (monocot) and dicotyledonous (dicot) 
plants. Examples of dicots include, but are not limited to, legumes including 

15 soybeans and alfalfa, tobacco, potatoes, tomatoes, and the like. Examples of 

monocots include, but are not limited to, maize, rice, oats, barley, wheat, members of 
the duckweed family, grasses, and the like. "Lower-order plants" refers to non- 
flowering plants including fems, horsetails, club mosses, mosses, liverworts, 
homworts, algae, for example, red, brown, and green algae, gametophytes, 

20 sporophytes of pteridophytes, and the like. In some embodiments, the plant of interest 

is a member of the duckweed family of plants. 

The term "duckweed" refers to members of the family Lemnaceae. This 
family currently is divided into five genera and 38 species of duckweed as follows: 
genus Lemna (L aequinoctialis, L disperma, L. ecuadoriensis, L. gibba, L.japonica, 

25 L. minor, L. miniscula, L obscura, L. perpusilla, L. tenera, L. trisulca, L turionifera, 

L. valdiviana); genus Spirodela (S. intermedia, S. polyrrhiza, S. punctata)', genus 
Wolffia {Wa. angusta, Wa. arrhiza, Wa. australina, Wa. borealis, Wa. brasiliensis, 
Wa. Columbiana, Wa. elongata, Wa. globosa, Wa. microscopica, Wa. neglecta); genus 
Wolfiella (WL caudata, Wl. denticulata, Wl. gladiata, Wl. hyalina, Wl. lingulata, WI. 

30 repunda, Wl. rotunda, and Wl. neotropica) and genus Landoltia (L punctata). Any 

other genera or species of Lemnaceae, if they exist, are also aspects of the present 
invention. Lemna species can be classified using the taxonomic scheme described by 
Landolt (1986) Biosystematic Investigation on the Family of Duckweeds: The family 
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of Lemnaceae—A Monograph Study (Geobatanischen Institut ETH, Stiftung Rubel, 
Zurich). 

The term "duckweed nodule" as used herein refers to duckweed tissue 
comprising duckweed cells where at least about 50%, 55%, 60%, 65%, 70%, 75%, 
5 80%, 85%, 90%, 95%, or 100% of the cells are differentiated cells. A "differentiated 

cell," as used herein, is a cell with at least one phenotypic characteristic (e.g., a 
distinctive cell morphology or the expression of a marker nucleic acid or protein) that 
distinguishes it from undifferentiated cells or from cells found in other tissue types. 
The differentiated cells of the duckweed nodule culture described herein form a tiled 
10 smooth surface of interconnected cells fused at their adjacent cell walls, with nodules 

that have begun to organize into frond primordium scattered throughout the tissue. 
The surface of the tissue of the nodule culture has epidermal cells connected to each 
other via plasmadesmata. Members of the duckweed family reproduce by clonal 
propagation, and thus are representative of plants that clonally propagate. 
1 5 "Duckweed-preferred codons" as used herein refers to codons that have a 

frequency of codon usage in duckweed of greater than 17%. 

"Lemwfl-preferred codons" as used herein refers to codons that have a 
frequency of codon usage in the genus Lemna of greater than 17%. 

''Lemna gibba-prQkrtQd codons" as used herein refers to codons that have a 
20 frequency of codon usage in Lemna gihha of greater than 17% where the frequency of 

codon usage in Lemna gibba was obtained from the Codon Usage Database (GenBank 
Release 1 13,0; at htip://www.kazusa.or.iD/codon/cgibin/'showcodon.cgi7species = 
Lemna+gibba+[gbpln]). 

"Translation initiation codon" refers to the codon that initiates the translation 
25 of the mRNA transcribed from the nucleotide sequence of interest. 

"Translation initiation context nucleotide sequence" as used herein refers to 
the identity of the three nucleotides directly 5' of die translation initiation codon. 

"Secretion" as used herein refers to translocation of a polypeptide across both 
the plasma membrane and the cell wall of a host plant cell. 
30 "Operably linked" as used herein in reference to nucleotide sequences refers to 

multiple nucleotide sequences that are placed in a functional relationship with each 
other. Generally, operably linked DNA sequences are contiguous and, where 
necessary to join two protein coding regions, in reading frame. 
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Isolated Polvnuclcorides and Polvpeptides 

The present invention provides isolated polynucleotides and polypeptides that 
are involved in further modification of plant TV-linked glycans (also referred to as 'W- 
5 glycans"). particularly an a 1 ,3-fiicosyltransferase (FucT) and P 1 ,2-xylosyltransferase 

(XylT) identified in Lemna minor, a member of the duckweed family, and variants 
and fragments of these polynucleotides and polypeptides. Inhibition of the expression 
of one or both of these proteins, or biologically active variants thereof, in a plant that 
expresses these proteins beneficially yields an A^-glycosylation pattern that has a 

10 reduction in the attachment of al ,3-fucose and pi, 2 -xylose residues to glycoprotein 

A/^glycans. In some embodiments of the invention, the methods disclosed herein 
provide for complete inhibition of expression of FucT and XylT, yielding an N- 
glycoslyation pattern of glycoproteins produced within a plant wherein the A^-linked 
glycans are devoid of aI,3-fucose and pi,2-xylose residues. 

1 5 The full-length cDNA sequence, including 5'- and 3'-UTR, for L. minor alpha 

1-3 fucosyltransferase (FucT) is set forth in Figure 1; see also SEQ ID NO: 1 (open 
reading frame set forth in SEQ ID N0:2). The predicted amino acid sequence 
encoded thereby is set forth in SEQ ID N0:3. At least two isoforms of the L minor 
FucT gene have been identified; the homology between the isoforms is about 90%. 

20 The encoded protein shares some similarity with other FucTs from other higher 

plants. See Figure 2. For example, the L minor FucT sequence shares approximately 
50.1% sequence identity with the Arabidopsis thaliana FucT shown in Figure 2. 

The full-length cDNA sequence, including 5'- and 3'-UTR, fori, minor 
xylosyltransferase (XylT) (isoform #1) is set forth in Figure 3; see also SEQ ID NO:4 

25 (ORF set forth in SEQ ID N0:5). The predicted amino acid sequence encoded 

thereby is set forth in SEQ ID N0:6. At least two isoforms of the L minor XylT gene 
have been identified; the homology between the isoforms is about 90%. The encoded 
protein shares some similarity with other XylTs from other higher plants. See Figure 
4. For example, the L minor XylT shares approximately 56.4% sequence identity 

30 with the Arabidopsis thaliana XylT shown in Figure 4. A partial-length cDNA 

sequence, including 3*-UTR, for L minor ^\'2 xylosyltransferase (XylT) (isoform #2) 
is set forth in Figure 3 1; see also SEQ ID NO: 19 (ORF set forth in SEQ ID NO:20). 
The predicted amino acid sequence encoded thereby is set forth in SEQ ID N0:21. 
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The partial-length XylT isoform #2 shares high sequence identity with the 
corresponding region of the full-length XylT isoform #1, as can be seen from the 
alignment shown in Figure 32. 

The invention encompasses isolated or substantially purified polynucleotide or 
5 protein compositions. An "isolated" or "purified" polynucleotide or protein, or 

biologically active portion thereof, is substantially or essentially free from 
components that normally accompany or interact with the polynucleotide or protein as 
found in its naturally occurring environment Thus, an isolated or purified 
polynucleotide or protein is substantially free of other cellular material, or culture 

10 medium when produced by recombinant techniques, or substantially free of chemical 

precursors or other chemicals when chemically synthesized. Optimally, an "isolated" 
polynucleotide is free of sequences (optimally protein encoding sequences) that 
naturally flank the polynucleotide (i.e., sequences located at the 5' and 3' ends of the 
polynucleotide) in the genomic DNA of the organism from which the polynucleotide 

15 is derived. For example, in various embodiments, the isolated polynucleotide can 

contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, O.S kb, or 0.1 kb of nucleotide 
sequence that naturally flank the polynucleotide in genomic DNA of the cell from 
which the polynucleotide is derived. A protein that is substantially free of cellular 
material includes preparations of protein having less than about 30%, 20%, 10%, 5%, 

20 or 1% (by dry weight) of contaminating protein. When the protein of the invention or 

biologically active portion thereof is recombinantly produced, optimally culture 
medium represents less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of 
chemical precursors or non-protein-of-interest chemicals. 

The coding sequence for the L, minor FucT gene is set forth as nucleotides (nt) 

25 243- 1 7 1 5 of SEQ ID NO: 1 and as SEQ ID N0:2, and the amino acid sequence for the 

encoded FucT polypeptide is set forth in SEQ ID N0:3. The coding sequence for the 
L minor XylT isoform #1 gene is set forth as nucleotides 63-1592 of SEQ ID N0:4 
and as SEQ ID NO: 5, and the amino acid sequence for the encoded XylT polypeptide 
is set forth in SEQ ID N0:6. The coding sequence for the partial-length L minor 

30 XylT isoform #2 gene is set forth as nucleotides 1 - 1 276 of SEQ ID NO: 1 9 and as 

SEQ ID NO:20, and the amino acid sequence for the encoded partial-length XylT 
polypeptide is set forth in SEQ ID N0:21. 
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In particular, the present invention provides for isolated polynucleotides 
comprising nucleotide sequences encoding the amino acid sequences shown in SEQ 
ID N0S:3, 6, and 21. Further provided are polypeptides having an amino acid 
sequence encoded by a polynucleotide described herein, for example those set forth in 
5 SEQ ID NOS: 1 , 2, 4, 5, 19, and 20, and fragments and variants thereof. Nucleic acid 

molecules comprising the complements of these nucleotide sequences are also 
provided. It is recognized that the coding sequence for the FucT and/or XylT gene 
can be expressed in a plant for overexpression of the encoded FucT and/or XylT. 
However, for purposes of suppressing or inhibiting the expression of these proteins, 

10 the respective nucleotide sequences of SEQ ID NO: 1 , 2, 4, 5, 1 9, and 20 will be used 

to design constructs for suppression of expression of the respective FucT and/or XylT 
protein. Thus, polynucleotides, in the context of suppressing the FucT protein refers 
to the FucT coding sequences and to polynucleotides that when expressed suppress or 
inhibit expression of the FucT gene, for example, via direct or indirect suppression as 

IS noted herein below. Similarly, polynucleotides, in the context of suppressing or 

inhibiting the XylT protein refers to the XylT coding sequences and to 
polynucleotides that when expressed suppress or inhibit expression of the XylT gene, 
for example, via direct or indirect suppression as noted herein below. 

Fragments and variants of the disclosed polynucleotides and proteins encoded 

20 thereby are also encompassed by the present invention. By "fragment" is intended a 

portion of the FucT or XylT polynucleotide or a portion of the FucT or XylT amino 
acid sequence encoded thereby. Fragments of a polynucleotide may encode protein 
fragments that retain the biological activity of the native protein and hence have FucT 
activity or XylT activity as noted elsewhere herein. Alternatively, fragments of a 

25 polynucleotide that are useful as hybridization probes generally do not encode 

fragment proteins retaining biological activity. Fragments of a FucT or XylT 
polynucleotide can also be used to design inhibitory sequences for suppression of 
expression of the FucT and/or XylT polypeptide. Thus, for example, fragments of a 
nucleotide sequence may range from at least about 15 nucleotides, 20 nucleotides, 

30 about 50 nucleotides, about 100 nucleotides, about 150 nucleotides, about 200 

nucleotides, about 250 nucleotides, about 300 nucleotides, about 350 nucleotides, 
about 400 nucleotides, about 450 nucleotides, about 500 nucleotides, about 550 
nucleotides, about 600 nucleotides, about 650 nucleotides, about 700 nucleotides. 
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about 750 nucleotides, about 800 nucleotides, and up to the ftill-length polynucleotide 
encoding the proteins of the invention. 

A fragment of a FucT polynucleotide that encodes a biologically active 
portion of a FucT protein of the invention will encode at least 15, 25, 30, 50, 100, 

5 150, 200. 250, 300, 350, 400, 450, 475, 500 contiguous amino acids, or up to the total 

number of amino acids present in a full-length FucT protein of the invention (for 
example, 509 amino acids for SEQ ID N0:3). A fragment of a XylT polynucleotide 
that encodes a biologically active portion of a full-length XylT protein of the 
invention will encode at least 15, 25, 30, 50, 100, 150, 200, 250, 300, 350, 400, 450, 

10 475 contiguous amino acids, or up to the total number of amino acids present in a full- 

length XylT protein of the invention (for example, 490 amino acids for SEQ ID 
N0:3). A fragment of a XylT polynucleotide that encodes a biologically active 
portion of a partial-length XylT protein of the invention will encode at least 15, 25, 
30, 50, 100, 150, 200, 250, 300, 350, 400 contiguous amino acids, or up to the total 

1 5 number of amino acids present in a partial-length XylT protein of the invention (for 

example, 490 amino acids for SEQ ID N0:21) 

Thus, a fragment of a FucT or XylT polynucleotide may encode a biologically 
active portion of a FucT or XylT protein, respectively, or it may be a fragment that 
can be used as a hybridization probe or PGR primer, or used to design inhibitory 

20 sequences for suppression, using methods disclosed below. A biologically active 

portion of a FucT or XylT protein can be prepared by isolating a portion of one of the 
FucT or XylT polynucleotides of the invention, respectively, expressing the encoded 
portion of the FucT or XylT protein (e.g., by recombinant expression in vitro), and 
assessing the activity of the encoded portion of the FucT or XylT polypeptide. 

25 Polynucleotides that are fragments of an FucT or XylT nucleotide sequence comprise 

at least 15, 20, 50, 75, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 
700, 800, 900, 1,000, 1,100, 1,200, 1,300, 1,400, or 1450 contiguous nucleotides, or 
up to the number of nucleotides present in a FucT or XylT polynucleotide disclosed 
herein (for example, 1865, 1473, 1860, 1530, 1282, or 1275 nucleotides for SEQ ID 

30 NOS: 1 , 2, 4, 5, 1 9, and 20, respectively). 

"Variants" is intended to mean substantially similar sequences. For 
polynucleotides, a variant comprises a deletion and/or addition of one or more 
nucleotides at one or more sites within the native polynucleotide and/or a substitution 
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of one or more nucleotides at one or more sites in the native polynucleotide. As used 
herein, a "native" polynucleotide or polypeptide comprises a naturally occurring 
nucleotide sequence or amino acid sequence, respectively. For polynucleotides, 
conservative variants include those sequences that, because of the degeneracy of the 

5 genetic code, encode the amino acid sequence of one of the FucT or XylT 

polypeptides of the invention. Naturally occurring allelic variants such as these can 
be identified with the use of well-known molecular biology techniques, as, for 
example, with polymerase chain reaction (PCR) and hybridization techniques as 
outlined below. Variant polynucleotides also include synthetically derived 

10 polynucleotides, such as those generated, for example, by using site-directed 

mutagenesis but which still encode a FucT or XylT protein of the invention. 
Generally, variants of a particular polynucleotide of the invention (for example, SEQ 
ID N0:1, SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 19. or SEQ ID 
NO:20, fragments thereof, and complements of these sequences) will have at least 

1 5 about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%. 80%, 85%. 90%. 91%. 92%, 

93%. 94%, 95%, 96%, 97%. 98%, 99% or more sequence identity to that particular 
polynucleotide as determined by sequence alignment programs and parameters 
described elsewhere herein. 

Variants of a particular polynucleotide of the invention (i.e., the reference 

20 polynucleotide) can also be evaluated by comparison of the percent sequence identity 

between the polypeptide encoded by a variant polynucleotide and the polypeptide 
encoded by the reference polynucleotide. Thus, for example, an isolated 
polynucleotide that encodes a polypeptide with a given percent sequence identity to 
the FucT or XylT polypeptide of SEQ ID N0:3, SEQ ID N0:6, or SEQ ID N0:2 1 , 

25 respectively, is disclosed. Percent sequence identity between any two polypeptides 

can be calculated using sequence alignment programs and parameters described 
elsewhere herein. Where any given pair of polynucleotides of the invention is 
evaluated by comparison of the percent sequence identity shared by the two 
polypeptides they encode, the percent sequence identity between the two encoded 

30 polypeptides is at least about 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%. 

85%, 90%, 91%, 92%, 93%. 94%, 95%, 96%, 97%, 98%, 99% or more sequence 
identity. 
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"Variant" protein is intended to mean a protein derived from the native protein 
by deletion or addition of one or more amino acids at one or more sites in the native 
protein and/or substitution of one or more amino acids at one or more sites in the 
native protein. Variant proteins encompassed by the present invention are 
5 biologically active, that is they continue to possess the desired biological activity of 

the native protein, that is, the enzymatic activity of attaching the al ,3 -linked fucose 
residue (activity of FucT) or pi,2-Unked xylose residue (activity of XylT) to 
glycoprotein A/-glycans in plants as described herein. Such variants may result from, 
for example, genetic polymorphism or from human manipulation. Biologically active 

10 variants of a native FucT or XylT protein of the invention will have at least about 

40%. 45%, 50%, 55%. 60%, 65%, 70%. 75%. 80%. 85%, 90%, 91%. 92%. 93%. 
94%, 95%, 96%, 97%, 98%, 99% or more sequence identity to the amino acid 
sequence for the native protein as determined by sequence alignment programs and 
parameters described elsewhere herein. A biologically active variant of a protein of 

15 the invention may differ from that protein by as few as 1-15 amino acid residues, as 

few as 1-10, such as 6-10, as few as 5, as few as 4, 3, 2, or even 1 amino acid residue. 

The proteins of the invention may be altered in various ways including amino 
acid substitutions, deletions, truncations, and insertions. Methods for such 
manipulations are generally known in the art. For example, amino acid sequence 

20 variants and fragments of the FucT and XylT proteins can be prepared by mutations in 

the DNA. Methods for mutagenesis and polynucleotide alterations are well known in 
the art. See, for example, Kunkel (1985) Proc, Natl Acad. ScL USA 82:488-492; 
Kunkel et al, (1987) Methods in EnzymoL 154:367-382; U.S. Patent No. 4.873,192; 
Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan 

25 Publishing Company, New York) and the references cited therein. Guidance as to 

appropriate amino acid substitutions that do not affect biological activity of the 
protein of interest may be found in the model of Dayhoff et al (1978) Atlas of Protein 
Sequence and Structure (Natl. Biomed. Res. Foxmd., Washington, D.C.), herein 
incorporated by reference. Conservative substitutions, such as exchanging one amino 

30 acid with another having similar properties, may be optimal. 

Thus, the polynucleotides of the invention include both the naturally occurring 
FucT and XylT sequences as well as mutant forms. Likewise, the proteins of the 
invention encompass both naturally occurring FucT and XylT proteins as well as 
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variations and modified forms thereof. Such variants will continue to possess the 
desired activity. Thus, where expression of a functional protein is desired, the 
expressed protein will possess the desired FucT or XylT activity, i.e., the enzymatic 
activity of attaching the a 1,3 -linked fucose residue (activity of FucT) or pi,2-linked 
5 xylose residue (activity of XylT) to glycoprotein A^-glycans in plants as described 

herein. Where the objective is inhibition of expression or function of the FucT and/or 
XylT polypeptide, the desired activity of the variant polynucleotide or polypeptide is 
one of inhibiting expression or function of the respective FucT and/or XylT 
polypeptide. Obviously, where expression of a functional FucT or XylT variant is 

10 desired, the mutations that will be made in the DNA encoding the variant must not 

place the sequence out of reading frame and optimally will not create complementary 
regions that could produce secondary mRNA structure. See, EP Patent Application 
Publication No. 75,444. 

Where a functional protein is desired, the deletions, insertions, and 

1 5 substitutions of the protein sequences encompassed herein are not expected to 

produce radical changes in the characteristics of the protein. However, when it is 
difficult to predict the exact effect of the substitution, deletion, or insertion in advance 
of doing so, one skilled in the art will appreciate that the effect will be evaluated by 
routine screening assays, including the assays for monitoring FucT and XylT activity 

20 described herein below in the Experimental section. 

Variant polynucleotides and proteins also encompass sequences and proteins 
derived from a mutagenic and recombinogenic procedure such as DNA shuffling. 
With such a procedure, one or more different FucT or XylT coding sequences can be 
manipulated to create a new FucT or XylT protein possessing the desired properties. 

25 In this manner, libraries of recombinant polynucleotides are generated from a 

population of related sequence polynucleotides comprising sequence regions that have 
substantial sequence identity and can be homologously recombined in vitro or in vivo. 
For example, using this approach, sequence motifs encoding a domain of interest may 
be shuffled between the FucT or XylT gene of the invention and other known FucT or 

30 XylT genes, respectively, to obtain a new gene coding for a protein with an improved 

property of interest. Strategies for such DNA shuffling are known in the art. See, for 
example. Stemmer (1994) Proc. Natl Acad. Sci. USA 91:10747-10751; Stemmer 
(1994) Nature 370:389-391; Crameri et al (1997) Nature Biotech, 15:436-438; 
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Moore et al (1997) J. Mol Biol 272:336-347; Zhang et al (1997) Proc, Natl Acad, 
Set USA 94:4504-4509; Crameri et al (1998) Nature 391:288-291; and U.S. Patent 
Nos. 5,605,793 and 5,837,458. 

The comparison of sequences and determination of percent identity and 
5 percent similarity between two sequences can be accomplished using a mathematical 

algorithm. In a preferred embodiment, the percent identity between two amino acid 
sequences is determined using the Needleman and Wunsch (1970) J. Mol Biol 
^5:444-453 algorithm, which is incorporated into the GAP program in the GCG 
software package (available at www.accelrys.com), using either a BLOSSUM62 

10 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length 

weight of 1, 2, 3, 4, 5, or 6, In yet another preferred embodiment, the percent identity 
between two nucleotide sequences is determined using the GAP program in the GCG 
software package, using a BLOSUM62 scoring matrix (see Henikoff al (1989) 
Proc. Nail Acad. Scl USA 89:10915) and a gap weight of 40, 50, 60, 70, or 80 and a 

15 length weight of 1, 2, 3, 4, 5, or 6. A particularly preferred set of parameters (and the 

one that should be used if the practitioner is imcertain about what parameters should 
be appHed to determine if a molecule is within a sequence identity limitation of the 
invention) is using a BLOSUM62 scoring matrix with a gap weight of 60 and a length 
weight of 3). 

20 The percent identity between two amino acid or nucleotide sequences can also 

be determined using the algorithm of Meyers and Miller ( 1 989) CABIOS 4:\\-\l 
which has been incorporated into the ALIGN program (version 2.0), using a PAM120 
weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

An alternative indication that two nucleic acid molecules are closely related is 

25 that the two molecules hybridize to each other under stringent conditions. Stringent 

conditions are sequence-dependent and are different under different environmental 
parameters. Generally, stringent conditions are selected to be about 5°C to 20°C 
lower than the thermal melting point (Tm) for the specific sequence at a defined ionic 
strength and pH. The Tm is the temperature (under defined ionic strength and pH) at 

30 which 50% of the target sequence hybridizes to a perfectly matched probe. 

Conditions for nucleic acid hybridization and calculation of stringencies can be found, 
for example, in Sambrook et al (2001) Molecular Cloning: A Laboratory Manual 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY) and Tijssen (1993) 
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Hybridization With Nucleic Acid Probes^ Part J: Theory and Nucleic Acid 
Preparation (Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Ltd,, NY, NY). 

For purposes of the present invention, "stringent conditions" encompass 

5 conditions under which hybridization will only occur if there is less than 25% 

mismatch between the hybridization molecule and the target sequence. "Stringent 
conditions" may be broken down into particular levels of stringency for more precise 
definition. Thus, as used herein, "moderate stringency" conditions are those under 
which molecules with more than 25% sequence mismatch will not hybridize; 

10 conditions of "medium stringency" are those under which molecules with more than 

15% mismatch will not hybridize, and conditions of "high stringency" are those under 
which sequences with more than 10% mismatch will not hybridize. Conditions of 
"very high stringency" are those under which sequences with more than 6% mismatch 
will not hybridize. 

1 5 The FucT and XylT polynucleotides of the invention can be used as probes for 

the isolation of corresponding homologous sequences in other organisms, more 
particularly in other plant species. In this manner, methods such as PCR, 
hybridization, and the like can be used to identify such sequences based on their 
sequence homology to the sequences of the invention. See, for example, Sambrook et 

20 aL (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor 

Laboratory Press, Plainview, New York) and Innis et aL (1 990), PCR Protocols: A 
Guide to Methods and Applications (Academic Press, New York). Polynucleotide 
sequences isolated based on their sequence identity to the entire FucT or XylT 
polynucleotides of the invention (i.e., SEQ ID NOS: 1 and 2 for FucT; SEQ ID N0S:4 

25 and 5 for XylT isoform #1 of SEQ ID N0:6; and SEQ ID NOS: 1 9 and 20 for XylT 

isoform #2 of SEQ ID NO: 21 ) or to fragments and variants thereof are encompassed 
by the present invention. 

In a PCR method, oligonucleotides primers can be designed for use in PCR 
reactions for amplification of corresponding DNA sequences from cDNA or genomic 

30 DNA extracted from any organism of interest. Known methods of PCR include, but 

are not limited to, methods using paired primers, nested primers, single specific 
primers, degenerate primers, gene-specific primers, vector-specific primers, partially- 
mismatched primers, and the like. Methods for designing PCR primers and PCR 
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cloning are generally known in the art and are disclosed in Sambrook et al, (1989) 
Molecular Cloning: A Laboratory Manual (2d ed.. Cold Spring Harbor Laboratory 
Press, Plain view. New York). See also Innis et al, eds. (1990) PCR Protocols: A 
Guide to Methods and Applications (Academic Press, New York); Innis and Gelfand, 

5 eds. (1995) PCR Strategies (Academic Press, New York); and Innis and Gelfand, eds. 

(1999) PCR Methods Manual (Academic Press, New York). 

In a hybridization method, all or part of a known nucleotide sequence can be 
used as a probe that selectively hybridizes to other corresponding polynucleotides 
present in a population of cloned genomic DNA fragments or cDNA fragments (i.e., 

10 cDNA or genomic libraries) from another organism of interest. The so-called 

hybridization probes may be genomic DNA fragments, cDNA fragments, RNA 
fragments, or other oligonucleotides, and may be labeled with a detectable group such 
as ^^P, or any other detectable marker. Probes for hybridization can be made by 
labeling synthetic oligonucleotides based on the nucleotide sequence of interest, for 

15 example, the FucT or XylT polynucleotides of the invention. Degenerate primers 

designed on the basis of conserved nucleotides or amino acid residues in the known 
nucleotide or encoded amino acid sequence can additionally be used. Methods for 
construction of cDNA and genomic libraries, and for preparing hybridization probes, 
are generally known in the art and are disclosed in Sambrook et al. (1989) Molecular 

20 Cloning: A Laboratory Manual (2d ed.. Cold Spring Harbor Laboratory Press, 

Plainview, New York), herein incorporated by reference. 

For example, all or part of the specific known FucT or XylT polynucleotide 

4 

sequence may be used as a probe that selectively hybridizes to other FucT or XylT 
nucleotide and messenger RNAs. To achieve specific hybridization under a variety of 

25 conditions, such probes include sequences that are unique and are preferably at least 

about 10 nucleotides in length, and more optimally at least about 20 nucleotides in 
length. This technique may be used to isolate other corresponding FucT or XylT 
nucleotide sequences from a desired organism or as a diagnostic assay to determine 
the presence of a FucT or XylT coding sequences in an organism. Hybridization 

30 techniques include hybridization screening of plated DNA libraries (either plaques or 

colonies; see, for example, Innis et al., eds. (1990) PCR Protocols: A Guide to 
Methods and Applications (Academic Press, New York)), 
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Thus, in addition to the native FucT and XylT polynucleotides and fragments 
and variants thereof, the isolated polynucleotides of the invention also encompass 
homologous DNA sequences identified and isolated from other organisms by 
hybridization with entire or partial sequences obtained from the FucT or XylT 
5 polynucleotides of the invention or variants thereof. Conditions that will permit other 

DNA sequences to hybridize to the DNA sequences disclosed herein can be 
determined in accordance with techniques generally known in the art. For example, 
hybridization of such sequences may be carried out under various conditions of 
moderate, medium, high, or very high stringency as noted herein above. 

10 

Methods of the Invention 

The present invention is directed to methods for altering protein glycosylation 
patterns in higher plants, particularly in higher plants that serve as hosts for 
production of recombinant proteins, particularly recombinant mammaliara proteins of 

1 5 pharmaceutical interest. The methods find use in producing higher plants that are 

capable of producing recombinant proteins having an A/^glycosylation pattern that 
more closely resembles that found in mammals. Compositions of the invention 
include higher plants that are stably transformed to comprise an altered A^- 
glycosylation pattern of their endogenous (i.e., homologous) and recombinantly 

20 produced heterologous proteins. In some embodiments, the higher plants are 

transgenic plants that produce monoclonal antibodies (mAbs) to mammalian proteins 
that have enhanced ADCC activity relative to mAbs produced in a control plant that 
has not had the glycosylation machinery altered to reduce the plant-specific 
attachment of al,3-fucose residues to the N-glycans of homologous and heterologous 

25 glycoproteins produced therein. 

The methods of the invention target the suppression (i.e., inhibition) of the 
expression of one or both of the enzymes involved the production of complex 
glycoproteins in higher plants. Of particular interest is suppression of a 
fucosyltransferase or one or more isoforms thereof, suppression of a 

30 xylosyltransferase or one or more isoforms thereof, or suppression of expression of 

both of these proteins and one or more isoforms thereof. It is recognized that 
suppression of the fucosyltransferase and/or xylosyltransferase and one or more 
isoforms thereof can be accomplished transiently. Alternatively, by stably 
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suppressing the expression of the fucosyitransferase and/or xylosyltransferase, it is 
possible to produce transgenic higher plants that carry over from generation to 
generation, either asexually or sexually, the ability to produce glycoproteins having an 
A/^glycosylation pattern that more closely resembles that found in mammals, more 
5 particularly, in humans. This advantageously provides for the production of 

recombinant mammalian glycoproteins that have reduced attachment of the plant 
pi,2-linked xylose residue and/or al,3-linked fucosc residue to glycoprotein A^- 
glycans. 

Inhibition of the expression of one of advantageously both of these proteins in 

10 a plant, for example, a dicotyledonous or monocotyledonous plant, for example, a 

duckweed plant, can be carried out using any method known in the art. In this 
manner, a polynucleotide comprising an inhibitory sequence for FucT, XylT, or a 
combination thereof is introduced into the host cell of interest. For transient 
suppression, the FucT or XylT inhibitory sequence can be a chemically synthesized or 

15 in v//ro-produced small interfering RNA (siRNA) or micro RNA (miRNA) that, when 

introduced into the host cell, would directly, though transiently, inhibit FucT, XylT, 
or a combination thereof, by silencing expression of these target gene product(s). 

Alternatively, stable suppression of expression of FucT, XylT, or a 
combination thereof is desirable as noted herein above. Thus, in some embodiments, 

20 the activity of the FucT or the XylT polypeptide of the invention is reduced or 

eliminated by transforming a plant cell with an expression cassette that expresses a 
polynucleotide that inhibits the expression of the FucT or XylT, or both. The 
polynucleotide may inhibit the expression of the FucT or XylT, or both, directly, by 
preventing transcription or translation of the FucT or XylT messenger RNA, or 

25 indirectly, by encoding a polypeptide that inhibits the transcription or translation of a 

gene encoding the FucT or XylT, or both. Methods for inhibiting or eliminating the 
expression of a gene in a plant are well known in the art, and any such method may be 
used in the present invention to inhibit the expression of FucT or XylT, or both. 

Thus, in some embodiments, expression of the FucT and/or XylT protein can 

30 be inhibited by introducing into the plant a nucleotide construct, such as an expression 

cassette, comprising a sequence that encodes an inhibitory nucleotide molecule that is 
designed to silence expression of the FucT and/or XylT gene product of interest, such 
as sense-orientation RNA, antisense RNA, double-stranded RNA (dsRNA), hairpin 
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RNA (hpRNA), intron-containing hpRNA, catalytic RNA, miRNA, and the like. In 
other embodiments, the nucleotide construct, for example, an expression cassette, can 
comprise a sequence that encodes a mRNA, the translation of which yields a 
polypeptide of interest that inhibits expression or function of the FucT and/or XylT 
5 gene product of interest. Where the nucleotide construct comprises a sequence that 

encodes an inhibitory nucleotide molecule or a mRNA for a polypeptide of interest, 
the sequence is operably linked to a promoter that drives expression in a plant cell so 
that the encoded inhibitory nucleotide molecule or mRNA can be expressed. 

In accordance with the present invention, the expression of a FucT or XylT 

10 gene is inhibited if the protein level of the FucT or XylT is statistically lower than the 

protein level of the same FucT or XylT in a plant that has not been genetically 
modified or mutagenized to inhibit the expression of that FucT or XylT. In particular 
embodiments of the invention, the protein level of the FucT or XylT, or both, in a 
modified plant according to the invention is less than 95%, less than 90%, less than 

15 80%, less than 70%, less than 60%, less than 50%, less than 40%, less than 30%, less 

than 20%, less than 10%, or less than 5% of the protein level of the same FucT or 
XylT in a plant that is not a mutant or that has not been genetically modified to inhibit 
the expression of that FucT or XylT, or both the FucT or XylT. The expression level 
of the FucT or XylT, or both, may be measured directly, for example, by assaying for 

20 the level of FucT or XylT, or both, expressed in the plant cell or plant, or indirectly, 

for example, by observing the effect in a transgenic plant at the phenotypic level, i.e., 
by transgenic plant analysis, observed as a reduction, or even elimination, of the 
attachment of pi ,2-xylose and/or al ,3-fucose residues to the glycoprotein A^-glycans 
in the plant. 

25 In other embodiments of the invention, the activity of FucT or XylT, or both, 

is reduced or eliminated by transforming a plant cell with an expression cassette 
comprising a polynucleotide encoding a polypeptide that inhibits the activity of FucT 
or XylT, or both. The activity of a FucT or XylT is inhibited according to the present 
invention if the activity of the FucT or XylT is statistically lower than the activity of 

30 the same FucT or XylT in a plant that has not been genetically modified to inhibit the 

activity of that FucT or XylT. In particular embodiments of the invention, the activity 
of the FucT or XylT in a modified plant according to the invention is less than 95%, 
less than 90%. less than 80%, less than 70%, less than 60%, less than 50%. less than 
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40%, less than 30%, less than 20%, less than 10%, or less than 5% of the activity of 
the same FucT or XylT in a plant that has not been genetically modified to inhibit the 
expression of that FucT or XylT. The activity of a FucT or XylT is "eliminated" 
according to the invention when it is not detectable by the assay methods described 

5 elsewhere herein. 

In other embodiments, the activity of a FucT or XylT, or both, may be reduced 
or eliminated by disrupting the gene encoding the FucT or XylT, or both of these 
genes. The invention encompasses mutagenized plants, particularly plants that are 
members of the duckweed family, that carry mutations in a FucT or XylT gene, or 

10 mutations in both genes, where the mutations reduce expression of the FucT and/or 

XylT gene or inhibit the activity of the encoded FucT and/or XylT. 

The methods of the invention can involve any method or mechanism known in 
the art for reducing or eliminating the activity or level of FucT and/or XylT in the 
cells of a higher plant, including, but not limited to, antisense suppression, sense 

1 5 suppression, RNA interference, directed deletion or mutation, dominant-negative 

strategies, and the like. Thus, the methods and compositions disclosed herein are not 
limited to any mechanism or theory of action and include any method where 
expression or function of FucT and/or XylT is inhibited in the cells of the higher plant 
of interest, thereby altering the A^-glycosylation pattern of endogenous and 

20 heterologous glycoproteins produced in the plant. 

For example, in some embodiments, the FucT inhibitory sequence or the XylT 
inhibitory sequence (or both) is expressed in the sense orientation, wherein the sense- 
oriented transcripts cause cosuppression of the expression of one or both of these 
enzymes. Alternatively, the FucT and/or XylT inhibitory sequence (e.g., full-length 

25 sequence, truncated sequence, fragments of the sequence, combinations thereof, and 

the like) can be expressed in the antisense orientation and thus inhibit endogenous 
FucT and/or XlyT expression or function by antisense mechanisms. 

In yet other embodiments, the FucT and/or XylT inhibitory sequence or 
sequences are expressed as a hairpin RNA, which comprises both a sense sequence 

30 and an antisense sequence. In embodiments comprising a hairpin structure, the loop 

structure may comprise any suitable nucleotide sequence including for example 5' 
untranslated and/or translated regions of the gene to be suppressed, such as the S' 
UTR and/or translated region of the FucT polynucleotide of SEQ ID N0:1 or 2, or the 
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5' UTR and/or translated region of the XylT polynucleotide of SEQ ID N0:4, 5, 19, or 
20, and the like. In some embodiments, the FucT or XylT inhibitory sequence 
expressed as a haiipin is encoded by an inverted region of the FucT or XylT 
nucleotide sequence. In yet other embodiments, the FucT and/or XylT inhibitory 

5 sequences are expressed as double-stranded RNA, where one FucT and/or XylT 

inhibitory sequence is expressed in the sense orientation and another complementary 
sequence is expressed in the antisense orientation. Double-stranded RNA, hairpin 
structures, and combinations thereof comprising FucT nucleotide sequences, XylT 
nucleotide sequences, or combinations thereof may operate by RNA interference, 

10 cosupprcssion, antisense mechanism, any combination thereof, or by means of any 

other mechanism that causes inhibition of FucT and/or XylT expression or function. 

ThuvS, many methods may be used to reduce or eliminate the activity of a FucT 
or XylT, or both of these proteins, and any isoforms thereof. By "isoform" is 
intended a naturally occurring protein variant of the FucT or XylT protein of interest, 

1 5 where the variant is encoded by a different gene. Generally, isoforms of a particular 

FucT or XylT protein of interest are encoded by a nucleotide sequence having at least 
90% sequence identity to the nucleotide sequence encoding the FucT or XylT protein 
of interest. More than one method may be used to reduce or eliminate the activity of a 
single plant FucT or XylT, and isoforms thereof. Non-limiting examples of methods 

20 of reducing or eliminating the activity of a plant FucT or XylT are given below. 

Polynucleotide-Based Methods: 

In some embodiments of the present invention, a plant cell is transformed with 
25 an expression cassette that is capable of expressing a polynucleotide that inhibits the 

expression of FucT or XylT, or both. The term "expression" as used herein refers to 
the biosynthesis of a gene product, including the transcription and/or translation of the 
gene product. For example, for the purposes of the present invention, an expression 
cassette capable of expressing a polynucleotide that inhibits the expression of at least 
30 one FucT or XylT, or both, is an expression cassette capable of producing an RNA 

molecule that inhibits the transcription and/or translation of at least one FucT or XylT, 
or both. The "expression" or "production" of a protein or polypeptide from a DNA 
molecule refers to the transcription and translation of the coding sequence to produce 
the protein or polypeptide, while the "expression" or "production" of a protein or 
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polypeptide from an RNA molecule refers to the translation of the RNA coding 
sequence to produce the protein or polypeptide. 

Examples of polynucleotides that inhibit the expression of a FucT or XylT, or 
both, are given below. 

5 

Sense SuDDression/CosuDpression 

In some embodiments of the invention, inhibition of the expression of FucT or 
XylT, or both, may be obtained by sense suppression or cosuppression. For 

10 cosuppression, an expression cassette is designed to express an RNA molecule 

corresponding to all or part of a messenger RNA encoding a FucT or XylT, or both, in 
the ""sense" orientation. Overexpression of the RNA molecule can result in reduced 
expression of the native gene. Accordingly, multiple plant lines transformed with the 
cosuppression expression cassette are screened to identify those that show the greatest 

1 5 inhibition of FucT or XylT expression. 

The polynucleotide used for cosuppression may correspond to all or part of the 
sequence encoding the FucT or XylT, all or part of the 5* and/or 3* untranslated region 
of a FucT or XylT transcript, or all or part of both the coding sequence and the 
untranslated regions of a transcript encoding FucT or XylT. In some embodiments 

20 where the polynucleotide comprises all or part of the coding region for the FucT or 

XylT protein, the expression cassette is designed to eliminate the start codon of the 
polynucleotide so that no protein product will be transcribed. 

Cosuppression may be used to inhibit the expression of plant genes to produce 
plants having undetectable protein levels for the proteins encoded by these genes. 

25 See, for example, Broin et aL (2002) Plant Cell 14: 1417-1432. Cosuppression may 

also be used to inhibit the expression of multiple proteins in the same plant. See, for 
example, U.S. Patent No. 5,942,657. Methods for using cosuppression to inhibit the 
expression of endogenous genes in plants are described in Flavell et al (1994) Proc, 
Natl Acad, ScL USA 91:3490-3496; Jorgensen etaL (1996) Plant Moi Biol 31:957- 

30 973; Johansen and Carrington (2001) Plant Physiol 126:930-938; Broin et al (2002) 

Plant Cell 14:1417-1432; Stoutjesdijk a/. (2002) Plant Physiol 129:1723-1731; Yu 
et al (2003) Phytochemistry 63:753-763; and U.S. Patent Nos. 5,034,323, 5,283,184, 
and 5,942,657; each of which is herein incorporated by reference. The efficiency of 
cosuppression may be increased by including a poly-dT region in the expression 
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cassette at a position 3' to the sense sequence and 5* of the polyadenylation signal. 
See, U.S. Patent Publication No. 20020048814, herein incorporated by reference. 
Typically, such a nucleotide sequence has substantial sequence identity to the 
sequence of the transcript of the endogenous gene, optimally greater than about 65% 
5 sequence identity, more optimally greater than about 85% sequence identity, most 

optimally greater than about 95% sequence identity. See, U.S. Patent Nos. 5,283,184 
and 5,034,323; herein incorporated by reference. 

Antisense Suppression 

10 In some embodiments of the invention, inhibition of the expression of FucT or 

XylT, or both, may be obtained by antisense suppression. For antisense suppression, 
the expression cassette is designed to express an RNA molecule complementary to all 
or part of a messenger RNA encoding the FucT or XylT. Overexpression of the 
antisense RNA molecule can result in reduced expression of the native gene. 

15 Accordingly, multiple plant lines transformed with the antisense suppression 

expression cassette are screened to identify those that show the greatest inhibition of 
FucT or XylT expression. 

The polynucleotide for use in antisense suppression may correspond to all or 
part of the complement of the sequence encoding the FucT or XylT, all or part of the 

20 complement of the 5' and/or 3' untranslated region of the FucT or XylT transcript, or 

all or part of the complement of both the coding sequence and the untranslated regions 
of a transcript encoding the FucT or XylT. In addition, the antisense polynucleotide 
may be fully complementary (i.e., 100% identical to the complement of the target 
sequence) or partially complementary (i.e., less than 100% identical to the 

25 complement of the target sequence) to the target sequence. Antisense suppression 

may be used to inhibit the expression of multiple proteins in the same plant. See, for 
example, U.S. Patent No. 5,942,657. Furthermore, portions of the antisense 
nucleotides may be used to disrupt the expression of the target gene. Generally, 
sequences of at least 50 nucleotides, 100 nucleotides, 200 nucleotides, 300, 400, 450, 

30 500, 550, or greater may be used. Methods for using antisense suppression to inhibit 

the expression of endogenous genes in plants are described, for example, in Liu et ai 
(2002) Plant Physiol 129:1732-1743 and U.S. Patent Nos. 5,759,829 and 5,942,657, 
each of which is herein incorporated by reference. Efficiency of antisense 
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suppression may be increased by including a poly-dT region in the expression cassette 
at a position 3* to the antisense sequence and 5' of the polyadenylation signal. See, 
U.S. Patent Publication No. 20020048814, herein incorporated by reference. 



5 Double-Stranded RNA Interference 

In some embodiments of the invention, inhibition of the expression of a FucT 
or XylT, or both, may be obtained by double-stranded RNA (dsRNA) interference. 
For dsRNA interference, a sense RNA molecule like that described above for 
cosuppression and an antisense RNA molecule that is ftilly or partially 

10 complementary to the sense EINA molecule are expressed in the same cell, resulting in 

inhibition of the expression of the corresponding endogenous messenger RNA. 

Expression of the sense and antisense molecules can be accomplished by 
designing the expression cassette to comprise both a sense sequence and an antisense 
sequence. Alternatively, separate expression cassettes may be used for the sense and 

1 5 antisense sequences. Multiple plant lines transformed with the dsRNA interference 

expression cassette or expression cassettes are then screened to identify plant lines 
that show the greatest inhibition of FucT or XylT expression. Methods for using 
dsRNA interference to inhibit the expression of endogenous plant genes are described 
in Waterhouse et al (1998) Proc. Natl Acad, Set. USA 95:13959-13964, Liu et al 

20 (2002) Plant Physiol 129:1732-1743, and WO 99/49029, WO 99/53050, WO 

99/6163 1 , and WO 00/49035; each of which is herein incorporated by reference. 



Hairpin RNA Interference and Intron-Containinz Hairpin RNA Interference 
In some embodiments of the invention, inhibition of the expression of FucT or 
25 XylT, or both, may be obtained by hairpin RNA (hpRNA) interference or intron- 

containing hairpin RNA (ihpRNA) interference. These methods are highly efficient at 
inhibiting the expression of endogenous genes. See, Waterhouse and Helliwell (2003) 
Nat. Rev. Genet, 4:29-38 and the references cited therein. 

For hpRNA interference, the expression cassette is designed to express an 
30 RNA molecule that hybridizes with itself to form a hairpin structure that comprises a 

single-stranded loop region and a base-paired stem. The base-paired stem region 
comprises a sense sequence corresponding to all or part of the endogenous messenger 
RNA encoding the gene whose expression is to be inhibited, and an antisense 
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sequence that is fully or partially complementary to the sense sequence. Thus, the 
base-paired stem region of the molecule generally determines the specificity of the 
RNA interference. hpRNA molecules are highly efficient at inhibiting the expression 
of endogenous genes, and the RNA interference they induce is inherited by 
5 subsequent generations of plants. See, for example, Chuang and Meyerowitz (2000) 

Proc, Natl. Acad. Sci. USA 97:4985-4990; Stoutjesdijk et al (2002) Plant Physiol. 
129:1723-1731; and Waterhouse and Helliwell (2003) A^a/. Rev. Genet. 4:29-38. 
Methods for using hpRNA interference to inhibit or silence the expression of genes 
are described, for example, in Chuang and Meyerowitz (2000) Proc. Natl Acad. Sci. 

10 USA 97:4985-4990; Stoutjesdijk et al (2002) Plant Physiol 129: 1723-173 1; 

Waterhouse and Helliwell (2003) Nat. Rev. Genet. 4:29-38; Pandolfmi et al BMC 
Biotechnology 3:7, and U.S. Patent Publication No. 20030175965; each of which is 
herein incorporated by reference. A transient assay for the efficiency of hpRNA 
constructs to silence gene expression in vivo has been described by Panstruga et al 

1 5 (2003) Mol Biol Rep. 30: 1 35- 140, herein incorporated by reference. 

For ihpRNA, the interfering molecules have the same general structure as for 
hpRNA, but the RNA molecule additionally comprises an intron that is capable of 
being spliced in the cell in which the ihpRNA is expressed. The use of an intron 
minimizes the size of the loop in the hairpin RNA molecule following splicing, and 

20 this increases the efficiency of interference. See, for example. Smith et al (2000) 

Nature 407:319-320. In fact. Smith et al show 100% suppression of endogenous 
gene expression using ihpRNA-mediated interference. Methods for using ihpRNA 
interference to inhibit the expression of endogenous plant genes are described, for 
example, in Smith et al (2000) Nature 407:3 19-320; Wesley et al (2001) Plant J. 

25 27:581-590; Wang and Waterhouse (2001) Curr. Opin. Plant Biol 5:146-150; 

Waterhouse and Helliwell (2003) Nat. Rev, Genet 4:29-38; Helliwell and Waterhouse 
(2003) Methods 30:289-295, and U.S. Patent Publication No. 20030180945, each of 
which is herein incorporated by reference. 

The expression cassette for hpRNA interference may also be designed such 

30 that the sense sequence and the antisense sequence do not correspond to an 

endogenous RNA. In this embodiment, the sense and antisense sequence flank a loop 
sequence that comprises a nucleotide sequence corresponding to all or part of the 
endogenous messenger RNA of the target gene. Thus, it is the loop region that 
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determines the specificity of the RNA interference. See, for example, WO 02/00904, 
herein incorporated by reference. 

Transcriptional gene silencing (TGS) may be accomplished through use of 
hpRNA constructs wherein the inverted repeat of the hairpin shares sequence identity 
5 with the promoter region of a gene to be silenced. Processing of the hpRNA into 

short RNAs which can interact with the homologous promoter region may trigger 
degradation or methylation to result in silencing (Aufsatz et ai (2002) PNAS 99 
(Suppl 4):16499-16506; Mettee^ al (2000) EMBO J 19(19):5 194-5201). 

Expression cassettes that are designed to express an RNA molecule that forms 

10 a hairpin structure are referred to herein as RNAi expression cassettes. In some 

embodiments, the RNAi expression cassette is designed in accordance with a strategy 
outlined in Figure 28. In such embodiments, an RNAi expression cassette can be 
designed to suppress the expression of the individual FucT and XylT genes (i.e., each 
cassette provides a single gene knockout), or can be designed to suppress the 

1 5 expression of both the FucT and XylT genes (i.e., a single RNAi expression cassette 

expresses an inhibitory molecule that provides for suppression of expression of both 
of these genes). Where the RNAi expression cassette suppresses expression of both 
the FucT and XylT genes, it is referred to herein as a "chimeric" RNAi expression 
cassette. The single-gene and chimeric RNAi expression cassettes can be designed to 

20 express larger hpRNA structures or, alternatively, small hpBlNA structures, as noted 

herein below. 

Thus, in some embodiments, the RNAi expression cassette is designed to 
express larger hpRNA structures having sufficient homology to the target mRNA 
transcript to provide for post-transcriptional gene silencing of one or both of a FucT 

25 and XylT gene. For larger hp RNA structures, the sense strand of the RNAi 

expression cassette is designed to comprise in the 5 -to-3* direction the following 
operably linked elements: a promoter of interest, a forward fragment of the FucT or 
XylT gene sequence comprising about 500 to about 800 nucleotides (nt) of a sense 
strand for FucT or XylT, respectively, a spacer sequence comprising about 100 to 

30 about 700 nt of any sequence as noted herein below, and a reverse fragment of the 

XylT or FucT gene sequence, wherein the reverse fragment comprises the antisense 
sequence complementary to the respective (i.e., FucT or XylT) forward fragment. 
Thus, for example, if a forward fragment is represented by nucleotides . .acttg. . .", 



-48- 



wo 2007/084922 



PCTAJS2007/060642 



the corresponding reverse fragment is represented by nucleotides "...caagt...", and 
the sense strand for such an RNAi expression cassette would comprise the following 
sequence: "5 . .acttg. . .nnnn...caagt. . .-3*. where "nnnn" represents the spacer 
sequence. 

S It is recognized that the forward fragment can comprise a nucleotide sequence 

that is 100% identical to the corresponding portion of the sense strand for the target 
FucT or XylT gene sequence, or in the alternative, can comprise a sequence that 
shares at least 90%, at least 95%, or at least 98% sequence identity to the 
corresponding portion of the sense strand for the target FucT or XylT gene to be 

10 silenced. In like manner, it is recognized that the reverse fragment does not have to 

share 100% sequence identity to the complement of the forward fragment; rather it 
must be of sufficient length and sufficient complementarity to the forward fragment 
sequence such that when the inhibitory RNA molecule is expressed, the transcribed 
regions corresponding to the forward fragment and reverse fragment will hybridize to 

1 5 form the base-paired stem (i.e., double-stranded portion) of the hp RNA structure. By 

"sufficient length" is intended a length that is at least 10%. at least 15%, at least 20%, 
at least 30%, at least 40% of the length of the forward fragment, more frequently at 
least 50%. at least 75%. at least 90%, or least 95% of the length of the forward 
fragment. By "sufficient complementarity" is intended the sequence of the reverse 

20 fragment shares at least 90%, at least 95%, at least 98% sequence identity with the 

complement of that portion of the forward fragment that will hybridize with the 
reverse fragment to form the base-paired stem of the hp RNA structure. Thus, in 
some embodiments, the reverse fragment is the complement (i.e., antisense version) 
of the forward fragment. 

25 In designing such an RNAi expression cassette, the lengths of the forward 

fragment, spacer sequence, and reverse fragments are chosen such that the combined 
length of the polynucleotide that encodes the hpRNA construct is about 650 to about 
2500 nt, about 750 to about 2500 nt, about 750 to about 2400 nt, about 1000 to about 
2400 nt, about 1200 to about 2300 nt, about 1250 to about 2100 nt, or about 1500 to 

30 about 1800. In some embodiments, the combined length of the expressed hairpin 

construct is about 650 nt, about 700 nt, about 750 nt, about 800 nt, about 850 nt, about 
900 nt, about 950 nt, about 1000 nt, about 1050 nt, about 1 100 nt, about 1 150 nt, 
about 1200 nt, about 1250 nt, about 1300 nt, about 1350 nt, about 1400 nt, about 1450 
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nt, about 1500 nt, about 1550 nt, about 1600 nt, about 1650 nt, about 1700 nt, about 
1750 nt, about 1800 nt, about 1850 nt, about 1900 nt, about 1950 nt. about 2000 nt, 
about 2050 nt, about 2100 nt, about 2150 nt, about 2200 nt, about 2250 nt, about 2300 
nt, or any such length between about 650 nt to about 2300 nt. 

5 In some embodiments, the forward fragment comprises about 500 to about 800 

nt, for example, 500, 525, 550, 575, 600, 625, 650, 675, 700, 725, 750, 775, or 800 nt 
of a sense strand for FucT or XylT, for example, of the sense strand set forth in SEQ 
ID NO: 1 or 2 (FucT) or SEQ ID N0:4, 5, 1 9, or 20 (XylT); the spacer sequence 
comprises about 100 to about 700 nt, for example, 100, 125, 150, 175, 200, 225, 250, 

10 275, 300, 325. 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 675, 

or 700 nt of any sequence as noted below, and the reverse fragment comprises the 
antisense strand for the forward fragment sequence, or a sequence having sufficient 
length and sufficient complementarity to the forward fragment sequence. 

The spacer sequence can be any sequence that has insufficient homology to 

15 the target gene, i.e.. FucT or XylT, and insufficient homology to itself such that the 

portion of the expressed inhibitory RNA molecule corresponding to the spacer region 
fails to self-hybridize, and thus forms the loop of the hairpin RNA structure. In some 
embodiments, the spacer sequence comprises an intron, and thus the expressed 
inhibitory RNA molecule forms an ihpRNA as noted herein above. In other 

20 embodiments, the spacer sequence comprises a portion of the sense strand for the 

FucT or XylT gene to be silenced, for example, a portion of the sense strand set forth 
in SEQ ID NO: 1 or 2 (FucT) or SEQ ID N0:4, 5, 19, or 20 (XylT), particulariy a 
portion of the sense strand immediately downstream from the forward fragment 
sequence. 

25 The operably linked promoter can be any promoter of interest that provides for 

expression of the operably linked inhibitory polynucleotide within the plant of 
interest, including one of the promoters disclosed herein below. The regulatory 
region can comprise additional regulatory elements that enhance expression of the 
inhibitory polynucleotide, including, but not limited to, the 5' leader sequences and 5* 

30 leader sequences plus plant introns discussed herein below. 

In one embodiment, the RNAi expression cassette is designed to suppress 
expression of the FucT polypeptide of SEQ ID N0:3, a biologically active variant of 
the FucT polypeptide of SEQ ID N0:3, or a FucT polypeptide encoded by a sequence 
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having at least 75% sequence identity to the sequence of SEQ ID NO: 1 or SEQ ID 
N0;2, for example, at least 75%, at least 80%, at least 85%, at least 90%. or at least 
95% sequence identity to the sequence of SEQ ID NO: 1 or SEQ ID N0:2. In this 
manner, the sense strand of the RNAi expression cassette is designed to comprise in 
5 the 5'-to-3' direction the following operably linked elements: a promoter of interest; a 

forward fragment of the FucT gene sequence, wherein the forward fragment 
comprises nt 255-985 of SEQ ID NO: 1; a spacer sequence comprising about 100 to 
about 700 nt of any sequence as noted above; and a reverse fragment of the FucT gene 
sequence, wherein the reverse fragment comprises the complement (i.e., antisense 

10 version) of nt 255-985 of SEQ ID NO: 1 . In one such embodiment, the spacer 

sequence is represented by nt 986-1444 of SEQ ID N0:1, and the total length of that 
portion of the sense strand of the RNAi expression cassette corresponding to the 
coding sequence for the hpRNA structure is 1918 nt. Stably transforming a plant with 
a nucleotide construct comprising this RNAi expression cassette, for example, the 

1 5 vector shown in Figure 8, effectively inhibits expression of FucT within the plant cells 

of the plant in which the hpRNA structure is expressed. In one embodiment, the plant 
of interest is a member of the duckweed family, for example, a member of the 
Lemnaceae, and the plant has been stably transformed with the vector shown in 
Figure 8. 

20 In other embodiments of the invention, the RNAi expression cassette is 

designed to suppress expression of the XylT polypeptide of SEQ ID NO:6 or SEQ ID 
N0:21, a biologically active variant of the XylT polypeptide of SEQ ID N0:6 or SEQ 
ID N0:21, or a XylT polypeptide encoded by a sequence having at least 75% 
sequence identity to the sequence of SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 19, or 

25 SEQ ID NO:20, for example, at least 75%, at least 80%, at least 85%, at least 90%, or 

at least 95% sequence identity to the sequence of SEQ ID N0:4, SEQ ID N0:5, SEQ 
ID NO: 19, or SEQ ID NO:20. In this manner, the sense strand of the RNAi 
expression cassette is designed to comprise in the 5'-to-3' direction the following 
operably linked elements: a promoter of interest; a forward fragment of the XylT 

30 gene sequence, wherein the forward fragment comprises nt 3 1 8-1 052 of SEQ ID 

N0:4; a spacer sequence comprising about 100 to about 700 nt of any sequence as 
noted above; and a reverse fragment of the XylT gene sequence, wherein the reverse 
fragment comprises the complement (i.e., antisense version) of nt 3 18-1052 of SEQ 
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IDN0:4. In one such embodiment, the spacer sequence is represented by nt 1053- 
1599 of SEQ ID N0:4, and the total length of that portion of the sense strand of the 
KNAi expression cassette corresponding to the coding sequence for the hpRNA 
structure is 201 5 nt. Stably transforming a plant with a nucleotide construct 
5 comprising this RNAi expression cassette, for example, the vector shown in Figure 9, 

effectively inhibits expression of FucT within the plant cells of the plant in which the 
hpRNA structure is expressed. In one embodiment, the plant of interest is a member 
of the duckweed family, for example, a member of the Lemnaceae, and the plant has 
been stably transformed with the vector shown in Figure 9. 

10 In other embodiments, the sense strand of the RNAi expression cassette is 

designed to comprise in the 5'-to-3' direction the following operably linked elements: 
a promoter of interest; a forward fragment of the XylT gene sequence, wherein the 
forward fragment comprises nt 1-734 of SEQ ID NO: 19; a spacer sequence 
comprising about 100 to about 700 nt of any sequence as noted above; and a reverse 

1 5 fragment of the XylT gene sequence, wherein the reverse fragment comprises the 

complement (i.e., antisense version) of nt 1-734 of SEQ ID NO: 19. In one such 
embodiment, the spacer sequence is represented by nt 735-1282 of SEQ ID NO: 19, 
and the total length of that portion of the sense strand of the RNAi expression cassette 
corresponding to the coding sequence for the hpRNA structure is 2015 nt. Stably 

20 transforming a plant with a nucleotide construct comprising this RNAi expression 

cassette, for example, the vector shown in Figure 9, effectively inhibits expression of 
FucT within the plant cells of the plant in which the hpRNA structure is expressed. In 
one embodiment, the plant of interest is a member of the duckweed family, for 
example, a member of the Lemnaceae, and the plant has been stably transformed with 

25 the vector shown in Figure 9. 

In yet other embodiments, larger hpRNA structures can be designed such that 
the antisense and sense sequences are in opposite orientation. In this manner, the 
sense strand of the RNAi expression cassette is designed to comprise in the 5'-to-3' 
direction the following operably linked elements: a promoter of interest, the fuU- 

30 length FucT or XylT gene sequence in the antisense orientation, and a forward 

fragment of the FucT or XylT gene sequence comprising the 3 -half of the sequence in 
the sense orientation (see Figure 28, design 1). In this type of construct, the 3'-half of 
the sequence forms the base-paired (i.e., double-stranded) stem of the hpRNA, and the 
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5'-half of the sequence acts as a spacer sequence. Without being bound by any theory 
or mechanism, the 3' region of the FucT or XylT sequence is chosen to form the 
double-stranded region of the hpRNA for for this construct because this region is 
relatively conserved among different plant species compared to the 5' region. 
S In one such embodiment, the RNAi expression cassette is designed to suppress 

expression of the FucT polypeptide of SEQ ID N0:3, a biologically active variant of 
the FucT polypeptide of SEQ ID N0:3, or a FucT polypeptide encoded by a sequence 
having at least 75% sequence identity to the sequence of SEQ ID NO: I or SEQ ID 
N0:2, for example, at least 75%, at least 80%, at least 85%. at least 90%, or at least 

10 95% sequence identity to the sequence of SEQ ID NO: 1 or SEQ ID N0:2. In this 

manner, the sense strand of the RNAi expression cassette is designed to comprise in 
the 5'-to-3' direction the following operably linked elements: a promoter of interest; 
nucleotides 1-1865 of SEQ ID N0:1 in antisense orientation, and nucleotides 950- 
1 865 of SEQ ID NO: 1 in the sense orientation. Stably transforming a plant with a 

15 nucleotide construct comprising this RNAi expression cassette effectively inhibits 

expression of FucT within the plant cells of the plant in which the hpRNA structure is 
expressed. In one embodiment, the plant of interest is a member of the duckweed 
family, for example, a member of the Lemnaceae. 

In another such embodiment, the RNAi expression cassette is designed to 

20 suppress expression of the XylT polypeptide of SEQ ID N0:6 or SEQ ID N0:21, a 

biologically active variant of the XylT polypeptide of SEQ ID N0:6 or SEQ ID 
N0:21, or a XylT polypeptide encoded by a sequence having at least 75% sequence 
identity to the sequence of SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 19, or SEQ ID 
NO:20, for example, at least 75%, at least 80%, at least 85%, at least 90%, or at least 

25 95% sequence identity to the sequence of SEQ ID NO: 5, SEQ ID NO: 1 9, or SEQ ID 

NO:20. In this manner, the sense strand of the RNAi expression cassette is designed 
to comprise in the 5*-to-3' direction the following operably linked elements: a 
promoter of interest, nucleotides 1-1860 of SEQ ID N0:4 in antisense orientation, and 
nucleotides 950-1860 of SEQ ID N0:4 in the sense orientation. Stably transforming a 

30 plant with a nucleotide construct comprising this RNAi expression cassette effectively 

inhibits expression of XylT within the plant cells of the plant in which the hpRNA 
structure is expressed. In one embodiment, the plant of interest is a member of the 
duckweed family, for example, a member of the Lemnaceae. 
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In another such embodiment, the sense strand of the RNAi expression cassette 
is designed to comprise in the 5'-to-3' direction the following operably linked 
elements: a promoter of interest, nucleotides 1-1282 of SEQ ID N0:19 in antisense 
orientation, and nucleotides 652-1282 of SEQ ID NO: 19 in the sense orientation. 
5 Stably transforming a plant with a nucleotide construct comprising this RNAi 

expression cassette effectively inhibits expression of XylT within the plant cells of the 
plant in which the hpRNA structure is expressed. In one embodiment, the plant of 
interest is a member of the duckweed family, for example, a member of the 
Lemnaceae. 

10 Where suppression of both the FucT and XylT proteins is desired, it can be 

achieved by introducing these single-gene RNAi expression cassettes into the plant in 
a single transformation event, for example, by assembling these single-gene RNAi 
expression cassettes within a single transformation vector, for example, a vector 
similar to that shown in Figure 1 1, or as separate co-transformation events, for 

15 example, by assembling these single-gene RNAi expression cassettes within two 

transformation vectors, for example, vectors similar to those shown in Figures 8 and 
9, using any suitable transformation method known in the art. including but not 
limited to the transformation methods disclosed elsewhere herein. 

Alternatively, suppression of both the FucT and XylT proteins can be achieved 

20 by introducing into the higher plant of interest a chimeric RNAi expression cassette as 

noted herein above. Thus, in some embodiments of the invention, the sense strand of 
a chimeric RNAi expression cassette is designed to comprise in the 5'-to-3* direction 
the following operably linked elements: a promoter of interest; a chimeric forward 
fragment, comprising about 500 to about 650 nucleotides (nt) of a sense strand for 

25 FucT and about 500 to about 650 nt of a sense strand for XylT, wherein the FucT 

sequence and XylT sequence can be in either order; a spacer sequence comprising 
about 100 to about 700 nt of any sequence; and a reverse fragment of the chimeric 
forward fragment, wherein the reverse fragment comprises the antisense sequence 
complementary to the respective chimeric forward fragment. 

30 As previously noted for the individual RNAi expression cassettes, it is 

recognized that the individual FucT or XlyT sequence within the chimeric forward 
fragment can comprise a nucleotide sequence that is 100% identical to the 
corresponding portion of the sense strand for the target FucT and XylT gene 
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sequence, respectively, or in the alternative, can comprise a sequence that shares at 
least 90%, at least 95%, or at least 98% sequence identity to the corresponding portion 
of the sense strand for the target FucT or XylT gene to be silenced. In like manner, it 
is recognized that the reverse fragment does not have to share 100% sequence identity 
5 to the complement of the chimeric forward fragment; rather it must be of sufficient 

length and sufficient complementarity to the chimeric forward fragment sequence, as 
defined herein above, such that when the inhibitory RNA molecule is expressed, the 
transcribed regions corresponding to the chimeric forward fragment and reverse 
fragment will hybridize to form the base-paired stem (i.e., double-stranded portion) of 

10 the hpRNA structure. In designing such a chimeric RNAi expression cassette, 

the lengths of the forward fragment, spacer sequence, and reverse fragments are 
chosen such that the combined length of the polynucleotide that encodes the hpRNA 
structure is about 1200 to about 3300 nt, about 1250 to about 3300 nt, about 1300 to 
about 3300 nt, about 1350 to about 3300 nt, about 1400 to about 3300 nt, about 1450 

15 nt to about 3 300 nt, about 1 500 to about 3300 nt, about 2200 to about 3 1 00 nt, about 

2250 to about 2800 nt, or about 2500 to about 2700 nt. In some embodiments, the 
combined length of the expressed hairpin construct is about 1 200 nt, about 1250 nt, 
about 1300 nt, about 1350 nt, about 1400 nt, about 1450 nt, about 1500 nt, about 1800 
nt, about 2200 nt, about 2250 nt, about 2300 nt, about 2350 nt, about 2400 nt. about 

20 2450 nt, about 2500 nt, about 2550 nt, about 2600 nt, about 2650 nt, about 2700 nt, 

about 2750 nt, about 2800 nt, about 2850 nt, about 2900 nt, about 2950 nt, about 3000 
nt, about 3050 nt, about'3100 nt, about 3150 nt, about 3200 nt, about 3250 nt, about 
3300 nt, or any such length between about 1200 nt to about 3300 nt. 

In some embodiments, the chimeric forward fragment comprises about 500 to 

25 about 650 nt, for example, 500, 525, 550, 575, 600, 625, or 650 nt, of a sense strand 

for FucT, for example, of the sense strand set forth in SEQ ID NO: 1 or 2, and about 
500 to about 650 nt, for example, 500, 525, 550, 575, 600, 625, or 650 nt, of a sense 
strand for XylT, for example, of the sense strand set forth in SEQ ID N0:4, 5, 19, or 
20, where the FucT and XylT sequence can be fused in either order; the spacer 

30 sequence comprises about 100 to about 700 nt, for example, 100, 200, 225, 250, 275, 

300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 675, or 700 
nt of any sequence of interest; and the reverse fragment comprises the antisense strand 
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for the chimeric forward fragment sequence, or a sequence having sufficient length 
and sufficient complementarity to the chimeric forward fragment sequence. 

As noted above for the single-gene RNAi expression cassettes, the spacer 
sequence can be any sequence that has insufficient homology to the target gene, i.e., 

5 FucT or XylT, and insufficient homology to itself such that the portion of the 

expressed inhibitory RNA molecule corresponding to the spacer region fails to self- 
hybridize, and thus forms the loop of the hpRNA structure. In some embodiments, 
the spacer sequence comprises an intron, and thus the expressed inhibitory RNA 
molecule forms an ihpRNA as noted herein above. In other embodiments, the spacer 

10 sequence comprises a portion of the sense strand for the FucT or XylT gene to be 

silenced, for example, a portion of the sense strand set forth in SEQ ID N0:1 or 2 
(FucT) or SEQ ID N0:4, 5, 19, or 20 (XylT). In one embodiment, the chimeric 
forward fragment comprises the FucT and XylT sequence fused in that order, and the 
spacer sequence comprises a portion of the XylT sense strand immediately 

15 downstream from the XylT sequence contained within the chimeric forward fragment. 

In another embodiment, the chimeric forward fragment comprises the XylT and FucT 
sequence fused in that order, and the spacer sequence comprises a portion of the FucT 
sense strand immediately downstream from the FucT sequence contained within the 
chimeric forward fragment, 

20 In some embodiments, the chimeric RNAi expression cassette is designed to 

suppress expression of the FucT polypeptide of SEQ ID NO:3, a biologically active 
variant of the FucT polypeptide of SEQ ID N0:3, or a FucT polypeptide encoded by a 
sequence having at least 75% sequence identity to the sequence of SEQ ID NO: 1 or 
SEQ ID N0:2, for example, at least 75%, at least 80%, at least 85%, at least 90%, or 

25 at least 95% sequence identity to the sequence of SEQ ID NO: 1 or SEQ ID N0:2, and 

to suppress expression of the XylT polypeptide of SEQ ID N0:6 or SEQ ID N0:21, a 
biologically active variant of the XylT polypeptide of SEQ ID N0:6 or SEQ ID 
N0:21, or a XylT polypeptide encoded by a sequence having at least 75% sequence 
identity to the sequence of SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 19, or SEQ ID 

30 NO:20, for example, at least 75%, at least 80%, at least 85%, at least 90%, or at least 

95% sequence identity to the sequence of SEQ ID N0:4, SEQ ID NO:5, SEQ ID 
NO: 19, or SEQ ID NO:20. For some of these embodiments, the FucT sequence 
within the chimeric forward fragment is chosen such that it corresponds to nt 700 to nt 
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1400 of SEQ ID NO: 1 or SEQ ID N0:2, and/or the XylT sequence within the 
chimeric forward fragment is chosen such that it corresponds to nt 700 to nt 1400 of 
SEQ ID N0:4 or SEQ ID N0:5, or nt 3 83 to 1 083 of SEQ ID NO: 1 9 or 20. Without 
being bound by theory, it is believed that this region (particularly for FucT) is 
5 relatively conserved among different plant species, and therefore is a potentially good 

target. 

In other embodiments, the sense strand of the chimeric RNAi expression 
cassette is designed to comprise in the 5*-to-3* direction the following operably linked 
elements: a promoter of interest; a chimeric forward fragment comprising nt 254-855 

10 of SEQ ID NO: 1 (FucT sequence) and nt 3 18-943 of SEQ ID N0:4 (XlyT sequence); 

a spacer sequence comprising about 100 to about 700 nt of any sequence as noted 
above; and a reverse fragment comprising the complement (i.e., antisense version) of 
the chimeric forward fragment, i.e., comprising the complement of nt 318-943 of SEQ 
ID N0:4 and the complement of nt 254-855 of SEQ ID NO: 1, In a particular 

1 5 embodiment, the spacer sequence within this chimeric RNAi expression cassette is 

represented by nt 944-1443 of SEQ ID N0:4, and the total length of that portion of 
the sense strand of the RNAi expression cassette corresponding to the coding 
sequence for the hpRNA structure is 2956 nt. 

In another such embodiment, the sense strand of the chimeric RNAi 

20 expression cassette is designed to comprise in the 5 -to-3' direction the following 

operably linked elements: a promoter of interest; a chimeric forward fragment 
comprising nt 318-943 of SEQ ID N0:4 (XlyT sequence) and nt 254-855 of SEQ ID 
NO: 1 (FucT sequence); a spacer sequence comprising about 100 to about 700 nt of 
any sequence as noted above; and a reverse fragment comprising the complement 

25 (i.e., antisense version) of the chimeric forward fragment, i.e., comprising the 

complement of nt 254-855 of SEQ ID NO: 1 and the complement of nt 3 1 8-943 of 
SEQ ID N0:4. in a particular embodiment, the spacer sequence within this chimeric 
RNAi expression cassette is represented by nt 856-1355 of SEQ ID N0:1, and the 
total length of that portion of the sense strand of the RNAi expression cassette 

30 corresponding to the coding sequence for the hpRNA structure is 2956 nt. 

In yet other embodiments, the sense strand of the chimeric ElNAi expression 
cassette is designed to comprise in the 5'-to-3' direction the following operably linked 
elements: a promoter of interest; a chimeric forward fragment comprising nt 254-855 
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of SEQ ID N0:1 (FucT sequence) and nt 1-626 of SEQ ID NO: 19 (XlyT sequence); a 
spacer sequence comprising about 100 to about 700 nt of any sequence as noted 
above; and a reverse fragment comprising the complement (i.e., antisense version) of 
the chimeric forward fragment, i.e., comprising the complement of nt 1-626 of SEQ 

5 ID NO: 1 9 and the complement of nt 254-855 of SEQ ID NO: 1 . In a particular 

embodiment, the spacer sequence within this chimeric RNAi expression cassette is 
represented by nt 627-1 126 of SEQ ID NO: 19, and the total length of that portion of 
the sense strand of the RNAi expression cassette corresponding to the coding 
sequence for the hpRNA structure is 2956 nt. 

10 In another such embodiment, the sense strand of the chimeric RNAi 

expression cassette is designed to comprise in the 5 -to-3' direction the following 
operably linked elements: a promoter of interest; a chimeric forward fragment 
comprising nt 1-626 of SEQ ID NO: 19 (XlyT sequence) and nt 254-855 of SEQ ID 
N0:1 (FucT sequence); a spacer sequence comprising about 100 to about 700 nt of 

15 any sequence as noted above; and a reverse fragment comprising the complement 

(i.e., antisense version) of the chimeric forward fragment, i.e., comprising the 
complement of nt 254-855 of SEQ ID N0:1 and the complement of nt 1-626 of SEQ 
ID NO: 19. In a particular embodiment, the spacer sequence within this chimeric 
RNAi expression cassette is represented by nt 856-1355 of SEQ ID N0:1, and the 

20 total length of that portion of the sense strand of the RNAi expression cassette 

corresponding to the coding sequence for the hpRNA structure is 2956 nt. 

Stably transforming a plant with a nucleotide construct comprising a chimeric 
RNAi expression cassette described herein, for example, stable transformation with 
the vector shown in Figure 10, effectively inhibits expression of both FucT and XylT 

25 within the plant cells of the plant in which the hpRNA structure is expressed. In one 

embodiment, the plant of interest is a member of the duckweed family, for example, a 
member of the Lemnaceae, and the plant has been stably transformed with the vector 
shown in Figure 10. 

It is recognized that the plant can be stably transformed with at least two of 

30 these chimeric RNAi expression cassettes to provide for very efficient gene silencing 

of the FucT and XylT proteins, including silencing of any isoforms of these two 
proteins. See, for example, the two orientations provided in "possible design 2" of 
Figure 28. In this manner, the plant can be stably transformed with a first chimeric 
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RNAi expression cassette wherein the chimeric forward fragment comprises the FucT 
and XylT sequence fused in that order, and the spacer sequence comprises a portion 
of the XylT sense strand immediately downstream from the XylT sequence contained 
within the chimeric forward fragment; and with a second chimeric RNAi expression 
5 cassette wherein the chimeric forward fragment comprises the XylT and FucT 

sequence fused in that order, and the spacer sequence comprises a portion of the FucT 
sense strand immediately downstream from the FucT sequence contained within the 
chimeric forward fragment. 

The operably linked promoter within any of the RNAi expression cassettes 

1 0 encoding large hpRN A structures, or large ihpRN A structures can be any promoter of 

interest that provides for expression of the operably linked inhibitory polynucleotide 
within the plant of interest, including one of the promoters disclosed herein below. 
The regulatory region can comprise additional regulatory elements that enhance 
expression of the inhibitory polynucleotide, including, but not limited to, the 5* leader 

1 5 sequences and S' leader sequences plus plant introns discussed herein below. 

In yet other embodiments, the RNAi expression cassette can be designed to 
provide for expression of small hpRNA structures having a base-paired stem region 
comprising about 200 base pairs or less. Expression of the small hpRNA structure is 
preferably driven by a promoter recognized by DNA-dependent RNA polymerase IE. 

20 See, for example, U.S. Patent Application No. 2004023 1016, herein incorporated by 

reference in its entirety. 

In this manner, the RNAi expression cassette is designed such that the 
transcribed DNA region encodes an RNA molecule comprising a sense and antisense 
nucleotide region, where the sense nucleotide sequence comprises about 19 

25 contiguous nucleotides having about 90% to about 100% sequence identity to a 

nucleotide sequence of about 19 contiguous nucleotides from the RNA transcribed 
from the gene of interest and where the antisense nucleotide sequence comprises 
about 19 contiguous nucleotides having about 90% to about 100% sequence identity 
to the complement of a nucleotide sequence of about 19 contiguous nucleotides of the 

30 sense sequence. The sense and antisense nucleotide sequences of the RNA molecule 

should be capable of forming a base-paired (i.e., double-stranded) stem region of 
RNA of about 19 to about 200 nucleotides, alternatively about 21 to about 90 or 100 
nucleotides, or alternatively about 40 to about 50 nucleotides in length. However, the 
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length of the base-paired stem region of the RNA molecule may also be about 30, 
about 60, about 70 or about 80 nucleotides in length. Where the base-paired stem 
region of the RNA molecule is larger than 19 nucleotides, there is only a requirement 
that there is at least one double-stranded region of about 19 nucleotides (wherein there 
5 can be about one mismatch between the sense and antisense region) the sense strand 

of which is "identical" (allowing for one mismatch) with 19 consecutive nucleotides 
of the target FucT or XylT polynucleotide of interest. The transcribed DNA region of 
this type of RNAi expression cassette may comprise a spacer sequence positioned 
between the sense and antisense encoding nucleotide region. The spacer sequence is 

10 not related to the targeted FucT or XylT polynucleotide, and can range in length from 

3 to about 100 nucleotides or alternatively from about 6 to about 40 nucleotides. This 
type of RNAi expression cassette also comprises a terminator sequence recognized by 
the RNA polymerase HI, the sequence being an oligo dT stretch, positioned 
downstream from the antisense-encoding nucleotide region of the cassette. By "oligo 

15 dT stretch" is a stretch of consecutive T-residues. It should comprise at least 4 T- 

residues, but obviously may contain more T-residues. 

It is recognized that in designing the short hpRNA, the fragments of the 
targeted gene sequence (i.e., fragments of FucT or XylT gene sequence) and any 
spacer sequence to be included within the hpRNA-encoding portion of the RNAi 

20 expression cassette are chosen to avoid GC-rich sequences, particularly those with 

three consecutive G/C's, and to avoid the occurrence of four or more consecutive T's 
or A's, as the string "TTTT. . ." serves as a terminator sequence recognized by the 
RNA polymerase ID. 

Thus, where gene silencing with a short hpRNA is desired, the RNAi 

25 expression cassette can be designed to comprise in the 5 -to-3* direction the following 

operably linked elements: a promoter recognized by a DNA dependent RNA 
polymerase IE of the plant cell, as defined herein below; a DNA fragment comprising 
a sense and antisense nucleotide sequence, wherein the sense nucleotide sequence 
comprises at least 19 contiguous nucleotides having about 90% to about 100% 

30 sequence identity to a nucleotide sequence of at least 1 9 contiguous nucleotides from 

the sense strand of the FucT or XylT gene of interest, and wherein the antisense 
nucleotide sequence comprises at least 19 contiguous nucleotides having about 90% 
to about 100% sequence identity to the complement of a nucleotide sequence of at 
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least 19 contiguous nucleotides of the sense sequence, wherein the sense and 
antisense nucleotide sequence are capable of forming a double-stranded RNA of 
about 19 to about 200 nucleotides in length; and an oligo dT stretch comprising at 
least 4 consecutive T-residues. 
5 In some embodiments of the invention, the RNAi expression cassette is 

designed to express a small hpRNA that suppresses expression of the FucT 
polypeptide of SEQ ID N0:3, a biologically active variant of the FucT polypeptide of 
SEQ ID N0:3, or a FucT polypeptide encoded by a sequence having at least 90% 
sequence identity to the sequence of SEQ ID NO: 1 or SEQ ID N0:2. In this manner, 

10 the RNAi expression cassette can be designed to comprise in the 5'-to-3' direction the 

following operably linked elements: a promoter recognized by a DNA dependent 
RNA polymerase HI of the plant cell, as defined herein below; a DNA fragment 
comprising a sense and antisense nucleotide sequence, wherein the sense nucleotide 
sequence comprises at least 19 contiguous nucleotides having about 90% to about 

IS 100% sequence identity to a nucleotide sequence of at least 19 contiguous nucleotides 

of SEQ ID NO: 1, and wherein the antisense nucleotide sequence comprises at least 19 
contiguous nucleotides having about 90% to about 100% sequence identity to the 
complement of a nucleotide sequence of at least 19 contiguous nucleotides of the 
sense sequence, wherein the sense and antisense nucleotide sequence are capable of 

20 forming a double-stranded RNA of about 19 to about 200 nucleotides in length; and 

an oligo dT stretch comprising at least 4 consecutive T-residues. 

In other embodiments of the invention, the RNAi expression cassette is 
designed to express a small hpRNA that suppresses expression of the XylT 
polypeptide of SEQ ID N0:6 or SEQ ID N0:21, a biologically active variant of the 

25 XylT polypeptide of SEQ ID N0:6 or SEQ ID N0:2 1 , or a XylT polypeptide encoded 

by a sequence having at least 90% sequence identity to the sequence of SEQ ID N0:4, 
SEQ ID N0:5, SEQ ID NO: 19, or SEQ ID NO:20. In this manner, the RNAi 
expression cassette can be designed to comprise in the 5'-to-3' direction the following 
operably linked elements: a promoter recognized by a DNA dependent RNA 

30 polymerase IE of the plant cell, as defined herein below; a DNA Augment comprising 

a sense and antisense nucleotide sequence, wherein the sense nucleotide sequence 
comprises at least 19 contiguous nucleotides having about 90% to about 100% 
sequence identity to a nucleotide sequence of at least 19 contiguous nucleotides of 
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SEQ ID N0:4, and wherein the antisense nucleotide sequence comprises at least 19 
contiguous nucleotides having about 90% to about 100% sequence identity to the 
complement of a nucleotide sequence of at least 19 contiguous nucleotides of the 
sense sequence, wherein the sense and antisense nucleotide sequence are capable of 
5 forming a double stranded RNA of about 19 to about 200 nucleotides in length; and 

an oligo dT stretch comprising at least 4 consecutive T-residues. 



Amplicori'Mediated Interference 

Amplicon expression cassettes comprise a plant virus-derived sequence that 
10 contains all or part of the target gene but generally not all of the genes of the native 

virus. The viral sequences present in the transcription product of the expression 
cassette allow the transcription product to direct its own replication. The transcripts 
produced by the amplicon may be either sense or antisense relative to the target 
sequence (i.e., the messenger RNA for FucT or XylT, or both). Methods of using 
1 5 amplicons to inhibit the expression of endogenous plant genes are described, for 

example, in Angell and Baulcombe (1997) EMBOJ, 16:3675-3684, Angell and 
Baulcombe (1999) Plant J, 20:357-362, and U.S. Patent No. 6,646,805, each of which 
is herein incorporated by reference. 



20 Ribozvmes 

In some embodiments, the polynucleotide expressed by the expression cassette 
of the invention is catalytic RNA or has ribozyme activity specific for the messenger 
RNA of FucT or XylT, or both. Thus, the polynucleotide causes the degradation of 
the endogenous messenger RNA, resulting in reduced expression of the FucT or 

25 XylT, or both. This method is described, for example, in U.S. Patent No. 4,987,07 1 , 

herein incorporated by reference. 



Small Interfering RNA or Micro RNA 

In some embodiments of the invention, inhibition of the expression of FucT or 
30 XylT, or both, may be obtained by RNA interference by expression of a gene 

encoding a micro RNA (miRNA). miRNAs are regulatory agents consisting of about 
22 ribonucleotides. miRNA are highly efficient at inhibiting the expression of 
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endogenous genes. See, for example Javier et al. (2003) Nature 425: 257-263, herein 
incorporated by reference. 

For mlRNA interference, the expression cassette is designed to express an 
RNA molecule that is modeled on an endogenous miRNA gene. The miRNA gene 
5 encodes an RNA that forms a hairpin structure containing a 22-nucleotide sequence 

that is complementary to another endogenous gene (target sequence). For suppression 
of FucT or XylT expression, the 22 -nucleotide sequence is selected from a FucT or 
XylT transcript sequence and contains 22 nucleotides of said FucT or XylT sequence 
in sense orientation and 21 nucleotides of a corresponding antisense sequence that is 
10 complementary to the sense sequence. miRNA molecules are highly efficient at 

inhibiting the expression of endogenous genes, and the RNA interference they induce 
is inherited by subsequent generations of plants. 

PolvDepUde-Based Inhibition of Gene Expression 
15 In one embodiment, the polynucleotide encodes a zinc finger protein that 

binds to a gene encoding a FucT or XylT, or both, resulting in reduced expression of 
the gene. In particular embodiments, the zinc finger protein binds to a regulatory 
region of a FucT or XylT gene. In other embodiments, the zinc finger protein binds to 
a messenger RNA encoding a FucT or XylT and prevents its translation. Methods of 
20 selecting sites for targeting by zinc finger proteins have been described, for example, 

in U.S. Patent No. 6,453,242, and methods for using zinc finger proteins to inhibit the 
expression of genes in plants are described, for example, in U.S. Patent EHiblication 
No. 20030037355; each of v^hich is herein incorporated by reference. 



25 Polvpeptide-Based Inhibition of Protein Activity 

In some embodiments of the invention, the polynucleotide encodes an 
antibody that binds to at least one FucT or XylT, and reduces the activity of the FucT 
or XylT. In another embodiment, the binding of the antibody results in increased 
turnover of the antibody-FucT or XylT complex by cellular quality control 

30 mechanisms. The expression of antibodies in plant cells and the inhibition of 

molecular pathways by expression and binding of antibodies to proteins in plant cells 
are well known in the art. See, for example, Conrad and Sonnewald (2003) Nature 
Biotech. 21:35-36, incorporated herein by reference. 
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Gene Disruption 

In some embodiments of the present invention, the activity of FucT or XylT, 
or both, is reduced or eliminated by disrupting the gene encoding the FucT or XylT, 
or both. The gene encoding the FucT or XylT, or both, may be disrupted by any 
method known in the art. For example, in one embodiment, the gene is disrupted by 
transposon tagging. In another embodiment, the gene is disrupted by mutagenizing 
plants using random or targeted mutagenesis, and selecting for plants that have 
reduced FucT and/or XylT activity. 



Transposon Tassins 

In one embodiment of the invention, transposon tagging is used to reduce or 
eliminate the activity of FucT or XylT, or both. Transposon tagging comprises 
inserting a transposon within an endogenous FucT or XylT gene to reduce or 

1 5 eliminate expression of the FucT or XylT. "FucT" or "XylT" gene is intended to 

mean the gene that encodes a FucT or XylT, respectively, according to the invention. 

In this embodiment, the expression of FucT or XylT is reduced or eliminated 
by inserting a transposon within a regulatory region or coding region of the gene 
encoding the FucT or XylT. A transposon that is within an exon, intron, 5' or 3' 

20 untranslated sequence, a promoter, or any other regulatory sequence of a FucT or 

XylT, or both, gene may be used to reduce or eliminate the expression and/or activity 
of the encoded FucT or XylT. 

Methods for the transposon tagging of specific genes in plants are well known 
in the art. See, for example, Maes et al. (1999) Trends Plant Sci, 4:90-96; 

25 Dharmapuri and Sonti ( 1 999) FEMS Microbiol Lett. 1 79:53-59; Meissner et al 

(2000) Plant X 22:265-274; Phogat et al (2000) / BioscL 25:57-63; Walbot (2000) 
Curr, Opin, Plant Biol 2:103-107; Gai et al (2000) Nucleic Acids Res, 28:94-96; 
Fitzmaurice et al (1999) Genetics 153:1919-1928). In addition, the TUSC process 
for selecting Mu insertions in selected genes has been described in Bensen et al 

30 (1995) Plant Cell 7:75-84; Mena et al (1996) Science 274:1537-1540; each of which 

is herein incorporated by reference. 

The invention encompasses additional methods for reducing or eliminating the 
activity of FucT or XylT. Examples of other methods for altering or mutating a 
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genomic nucleotide sequence in a plant are known in the art and include, but are not 
limited to, the use of RNA:DNA vectors, RNA:DNA mutational vectors, EINA:DNA 
repair vectors, mixed-duplex oligonucleotides, self-complementary KNArDNA 
oligonucleotides, and recombinogenic oligonucleobases. Such vectors and methods 
5 of use are knov^n in the art. See, for example, U.S. Patent Nos. 5,565,350; 5,73 1,181; 

5.756,325; 5,760,012; 5.795,972; and 5,871,984; each of which are herein 
incorporated by reference. See also. WO 98/49350, WO 99/07865, WO 99/25821, 
and Beetham etal (1999) Proc. Natl Acad, ScL USA 96:8774-8778; each of which is 
herein incorporated by reference. 

1 0 Thus inhibition of expression of FucT and/or XylT in a higher plant of interest 

can be accomplished by any of the foregoing methods in order to alter the N- 
glycosylation pattern of endogenous and heterologous glycoproteins produced within 
that plant such that these glycoproteins comprise complex A^-glycans that have a 
reduction in the amount of P 1 ,2-linked xylose residues and/or al ,3-linked fucose 

1 5 residues. The extent to which attachment of the P 1 ,2-linked xylose residue and/or 

al,3-linked fucose residue to glycoprotein A^-glycans is reduced is governed by the 
degree of inhibition of expression of the respective XylT and FucT enzymes. 

In some embodiments of the invention, recombinant glycoproteins produced in 
a plant host that is stably transformed using the methods described herein to target 

20 XylT expression have //-linked glycans comprising less than 50%, less than 40%, less 

than 30% of the pl,2-linked xylose residues occurring in the respective A^-linked 
glycans of glycloproteins produced in a plant host that has not been genetically 
modified to inhibit expression of the XylT enzyme and isoforms thereof. In other 
embodiments, these recombinant glycoproteins have A^-linked glycans comprising less 

25 than 25%, less than 20%, less than 15%, less than 10%, less than 5%, or less than 1% 

of the pi,2-linked xylose residues occurring in the respective //-linked glycans of 
glycloproteins produced in a plant host that has not been genetically modified to 
inhibit expression of the XylT enzyme and isoforms thereof In yet other 
embodiments, the methods of the invention provide for complete silencing of the 

30 XylT gene and any isoforms thereof within the stably transformed plant, such that the 

recombinant glycoproteins produced within the plant have //-linked glycans that are 
devoid of pi,2-linked xylose residues. 
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In like manner, where a plant host has been stably transformed using the 
methods described herein to target FucT expression, recombinant glycoproteins 
produced within the plant have A/-linked glycans comprising less than 50%, less than 
40%, less than 30% of the al ,3-linked fiicose residues occurring in the respective 
5 linked glycans of glycoproteins produced in a plant host that has not been genetically 

modified to inhibit expression of the FucT enzyme and isoforms thereof. In other 
embodiments, these recombinant glycoproteins have AMinked glycans comprising less 
than 25%, less than 20%, less than 15%, less than 10%, less than 5%, or less than 1% 
of the a 1,3 -linked fucose residues occurring in the respective A^-linked glycans of 

10 glycoproteins produced in a plant host that has not been genetically modified to 

inhibit expression of the FucT enzyme and isoforms thereof. In yet other 
embodiments, the methods of the invention provide for complete silencing of the 
FucT gene and any isoforms thereof within the stably transformed plant, such that the 
recombinant glycoproteins produced within the plant have TV-linked glycans that are 

1 5 devoid of a 1 ,3-linked fucose residues. 

Where a plant host has been stably transformed using the methods described 
herein to target expression of both the XylT and FucT enzymes, and any isoforms 
thereof, recombinant glycoproteins produced within the plant have iV-linked glycans 
comprising less than 50%, less than 40%, less than 30% of the pi,2-linked xylose 

20 residues and less than 50%, less than 40%, less than 30% of the al,3-linked fucose 

residues occurring in the respective A^-linked glycans of glycoproteins produced in a 
plant host that has not been genetically modified to inhibit expression of the XylT and 
FucT enzymes and isoforms thereof. In other embodiments, these recombinant 
glycoproteins have //-linked glycans comprising less than 25%, less than 20%, less 

25 than 15%, less than 10%, less than 5%, or less than 1% of the pi,2-linked xylose 

residues and 25%, less than 20%, less than 15%, less than 10%, less than 5%, or less 
than 1% of the al,3-linked fucose residues occurring in the respective AMinked 
glycans of glycoproteins produced in a plant host that has not been genetically 
modified to inhibit expression of the XylT and FucT enzymes and isoforms thereof. 

30 ^ In yet other embodiments, the methods of the invention provide for complete 

silencing of the XylT and FucT gene and any isoforms thereof within the stably 
transformed plant, such that the recombinant glycoproteins produced within the plant 
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have A/^linked glycans that are devoid of pi,2-linked xylose residues and al,3-linked 
fucose residues. 

In some embodiments of the present invention, a plant host that has been 
stably transformed using the methods described herein to target expression of both the 
5 XylT and FucT enzymes, and any isoforms thereof, is capable of producing 

recombinant glycoproteins wherein the A^-linked glycans are substantially 
homogenous. By "substantially homogenous" is intended that the glycosylation 
profile reflects the presence of a single major peak corresponding to a desired N- 
glycan species, more particularly, the GO glycan species, wherein at least 90% of the 

10 iV-glycan structures present on said glycoproteins are of the GO glycan species. 

Methods for monitoring changes in the A^-glycosylation pattern of 
glycoproteins, also referred to as glycosylation profiles, are well known in the art and 
include, but are not limited to, matrix-assisted laser desorption ionization time-of- 
flight (MALDl-TOF) mass spectrometry, for example, using the modified MALDI- 

1 5 TOP assay disclosed in Example 3 herein below, liquid chromatograph mass 

spectrometry (LC-MS), gas chromatography, anion-exchange chromatography, size- 
exclusion chromatography, high-concentration polyacrylaminde gel electrophoresis, 
nuclear magnetic resonance spectroscopy, and capillary electrophoresis and capillary 
gel electrophoresis, fluorescence labeling and detection by high-performance liquid 

20 chromatography (HPLC) and QTOF, and the like. In this manner, changes in the N- 

glycosylation pattern due to inhibition of the expression or function of XylT and/or 
FucT in a stably transformed plant of the invention can be monitored by subjecting a 
sample (for example, a leaf tissue sample) obtained from the stably transformed plant 
to total A^-glycan analysis by MALDI-TOF mass spectrometry, and comparing the 

25 results with those obtained for a comparable sample from a control plant, wherein the 

control plant has not been genetically modified to inhibit expression or function of 
XylT and/or FucT. A reduction in the amount of xylose and/or fiicose residues in the 
AZ-glycans can be monitored by a reduction of the mass of the respective peaks. See, 
for example, Strasser et al (2004) FEBS Letters 561 : 132-136. 

30 Similarly, the glycosylation profile of any given recombinantly produced 

glycoprotein is readily determined using standard techniques well known to those in 
the art. See, for example, the review provided in Morelle and Michalski (2005) Curr. 
Anal Chem. 1:29-57; herein incorporated by reference in its entirety. Thus, 
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glycoproteins that have been recombinantly produced in a host organism, including a 
plant host, can be analyzed for the ratio of the particular //-linked glycan structures 
attached thereto. In this manner, a sample comprising isolated recombinantly 
produced glycoprotein can be subjected to enzymatic or chemical reaction to release 

S the individual glycan structures from the glycoprotein. Following this 

deglycosylation step, analysis of the glycosylation profile can be carried out using any 
of the analytical assays described herein above. 

Recombinantly produced glycoprotein products typically exist as a diverse 
population of glycoforms carrying between one and several dozen different glycans in 

10 variable molar amounts at glycosylation sites with varying degrees of site occupancy. 

Depending upon the glycoprotein, different glycoforms can yield different functional 
profiles. Thus, in some embodiments of the invention, it is desirable to determine the 
glycosylation profile of the glycoprotein having the A^glycans intact. Any technique 
known in the art for determining the glycosylation profile of an intact glycoprotein 

15 can be used, including the mass spectrometry methods noted above and in the 

examples herein below. 

By reducing or eliminating the expression or function of fucosyltransferase 
and/or xylosyltransferase in the manner set forth herein, either transiently or stably, it 
is possible to produce a transgenic higher plant having the ability to produce 

20 glycoproteins having an iV-glycosylation profile with reduced heterogeneity relative to 

that normally observed for glycoproteins produced by this plant when expression or 
function of these enzymes has not been altered (i.e., the plant has the native or wild- 
type glycosylation machinery). Where expression or function of one or both of these 
enzymes is stably reduced or eliminated using one or more of the methods described 

25 herein above, the reduction in the heterogeneity of the A^-glycosylation profile of 

glycoproteins produced by the transgenic plant can be maintained from plant 
generation to plant generation, including with asexual or sexual reproduction, and can 
be maintained across cultural conditions and with scale-up in production. 

In this manner, the present invention provides a method for reducing 

30 heterogeneity of the A^-glycosylation profile of a glycoprotein produced in a higher 

plant, for example, a dicotyledonous or monocotyledonous plant, for example, a 
duckweed plant. The method comprises introducing into the plant a nucleotide 
construct described herein such that the expression or function of fucosyltransferase 
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and/or xylosyltransferase is reduced or eliminated within the plant. In some 
embodiments of the invention, the method for reducing heterogeneity of the N- 
glycosylation profile of a glycoprotein produced in a higher plant comprises 
introducing into the higher plant of interest at least one nucleotide construct described 
5 herein above, where the nucleotide construct(s) provides for suppression of the 

expression of fucosyltransferase and/or xylosyltransferase in the plant, for example, 
using one or more of the methods described herein above. 

By "reducing heterogeneity of the A^-glycosylation profile" it is intended that 
the 7V-glycosylation profile is characterized by a reduction in the total number of 

10 distinct A^-glycan species that appear in the profile. Thus, for example, where a 

glycoprotein produced by a higher plant having the native or wild-type glycosylation 
machinery (and thus which has not been genetically modified to reduce or eliminate 
expression of fucosyltransferase and xylosyltransferase) produces a glycoprotein with 
an N-glycosylation profile characterized by the presence of a mixture of 5 A^-glycan 

1 5 species, the methods of the invention can be used to reduce the number of iV^glycan 

species appearing in the ^-glycosylation profile. In this manner, when that higher 
plant is genetically modified in the manner set forth herein to reduce or eliminate 
expression or function of fucosyltransferase and/or xylosyltransferase, the A^- 
glycosylation profile of this glycoprotein would be characterized by a reduction in the 

20 number of A^-glycan species appearing in the profile, for example, a mixture of fewer 

than 5 A^-glycan species, for example, 4, 3, or 2 //-glycan species, or even a single N- 
glycan species. Where heterogeneity of the //-glycosylation profile is reduced such 
that the profile is characterized by the presence of a single predominant //-glycan 
species, the //-glycosylation profile would be substantially homogeneous for that N- 

25 glycan species. 

In some embodiments, the methods for reducing the heterogeneity of the N- 
glycosylation profile of a glycoprotein produced in a higher plant result in the 
produced glycoprotein having an iV-glycosylation profile that is substantially 
homogeneous for the GO glycan species. In such embodiments, the methods for 

30 reducing the heterogeneity of the A^-glycosylation profile of a glycoprotein produced 

in a higher plant result in the produced glycoprotein having a substantially 
homogeneous A^-glycosylation profile, wherein at least 80%, at least 85%, at least 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% of the total 
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amount of A^-glycan species appearing in the A^-glycosylation profile for the 
glycoprotein is represented by the GO glycan species. In these embodiments, a trace 
amount of precursor W-glycan species may appear in the A/-glycosylation profile as 
noted elsewhere herein, where any given precursor A^glycan species that is present in 
5 the iV-glycosylation profile is present at less than 5%, preferably less than 4%, less 

than 3%, less than 2%, less than 1%, and even less than 0.5% or even less than 0. 1% 
of the total amount of AT-glycan species appearing in the profile. 

The glycoprotein can be an endogenous glycoprotein of interest, or can be a 
heterologous glycoprotein that is produced by the higher plant of interest, for 

10 example, a mammalian glycoprotein, including, for example, the glycoproteins 

described elsewhere herein. In some embodiments, the glycoprotein is a monoclonal 
antibody. In other embodiments, the glycoprotein is selected from the group 
consisting of an interferon, erythropoietin (EPO), tissue plasminogen activator (tPA), 
plasminogen, granulocyte-macrophage colony stimulating factor (GM-CSF), and 

1 5 therapeutic immunoglobulins. 

Using the methods of the present invention, it is possible to maintain the 
reduced heterogeneity within the A^glycosylation profile for a glycoprotein produced 
in the transgenic plant with scale-up in production, and thus the plant continues to 
produce the glycoprotein such that its yV-glycosylation profile is characterized by a 

20 reduction in the number of A/^glycan species appearing in the profile. By "scale-up in 

production" or "increase in production scale" is intended an increase in the amount of 
plant biomass that is present within a culture system (i.e., a culture vessel or culture 
container within which the plant is cultured) that is being used to produce a protein of 
interest, in this case, a glycoprotein of interest. Thus, scale-up in production occurs, 

25 for example, when scaling up production from a scale that is suitable for research 

purposes to one that is suitable for pilot production, and further up to a scale that is 
suitable for commercial production of the glycoprotein of interest. 

In some embodiments, the transgenic higher plant is a monocotyledonous 
plant, for example, a duckweed plant, that serves as a host for recombinant production 

30 of a glycoprotein, and the reduced heterogeneity of the A^-glycosylation profile of the 

recombinantly produced glycoprotein is maintained with an increase in production 
scale, where the production scale is increased by at least 300-fold, at least 500-fold, at 
least 700-fold, at least 1,000- fold, at least 1,500-fold or greater over the initial starting 
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biomass. In some of these embodiments, the transgenic higher plant is a duckweed 
plant that recombinantly produces a glycoprotein of interest, and the reduced 
heterogeneity of the //-glycosylation profile is maintained with an increase in 
production scale, where the production scale is increased by at least 2,000-fold, at 
5 least 3,000-fold, at least 4,000-fold, at least 5,000-fold, at least 6,000-fold, at least 

6,500-fold, or greater over the initial starting biomass. In one such embodiment, the 
higher plant is a duckweed plant that recombinantly produces a glycoprotein of 
interest, and the reduced heterogeneity of the A/'-glycosylation profile for that 
glycoprotein is maintained with an increase in production scale, where the production 

10 scale is increased by at least 7,000-fold, 8,000-fold, 9,000-fold, 10,000-fold, 12,500- 

fold, 15,000-fold, 17,500-fold, 20,000-fold, 23,000-fold, 26,000-fold, or greater over 
the initial starting biomass. 

Furthermore, when the transgenic plant of interest is to be maintained by 
continuous clonal culture, the resulting transgenic line continues to produce 

1 5 glycoproteins that exhibit the reduced heterogeneity within their A^-glycosylation 

profile. Continuous clonal culture can be achieved using any suitable method known 
in the art. In some embodiments, continuous clonal culture is achieved by 
periodically taking one or more subsamples of the plant culture and transferring the 
subsample(s) to fresh culture medium for further culture. Thus, for example, in some 

20 embodiments, the transgenic plant line that is maintained by continuous clonal culture 

is a duckweed transgenic plant line that has been genetically modified to reduce or 
eliminate expression or fijnction of fijcosyltransferase and/or xylosyltransferase. In 
this manner, the reduced heterogeneity of the iV-glycosylation profile of glycoproteins 
produced in the transgenic plant line is maintained with continuous clonal culture of 

25 the transgenic plant line for at least 8 months, at least 10 months, at least 1 year, at 

least 1.5 years, at least 2 years, at least 2.5 years, at least 3 years, at least 3.5 years, at 
least 4 years, at least 4.5 years, at least 5 years, or longer, and can be maintained for 
as long as the transgenic plant line is maintained. 



30 Heterologous Polvpeotides and Glvcooroteins 

Higher plants, particularly higher plants that serve as expression systems for 
recombinant proteins for pharmaceutical use, that have been stably transformed to 
produce glycoproteins with an altered AT-glycoslyation pattern using the methods 
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described herein may be genetically modified to produce any recombinant protein of 
interest. Where the recombinant protein is one in which post-translational 
glycosylation is applicable, the methods of the invention advantageously provide a 
means to produce these glycoproteins with an A^-glycosylation pattern that more 

5 closely reflects that of mammalian hosts, particularly a glycosylation pattern that is 

"humanized." Examples of recombinant proteins of interest include, but are not 
limited to, insulin, growth hormone, plasminogen, a-interferon, (i-interferon, P- 
glucocerebrosidase, P-glucoronidase, retinoblastoma protein, p53 protein, angiostatin, 
leptin, monoclonal antibodies and fragments thereof, cytokines, for example, 

10 erythropoietin (EPO), granulocyte macrophage colony stimulating factor, tissue 

plaminogen activator, blood coagulation factors, for example. Factor Vn, Factor VIQ, 
Factor IX, and activated protein C, alpha 1 -antitrypsin, receptors, hormones, human 
vaccines, animal vaccines, peptides, and serum albumin. Furthermore, the transgenic 
higher plants of the invention are capable of producing a glycoprotein product that has 

15 a substantially homogenous glycosylation profile for the GO glycan species, and 

which is characterized by its substantial hoomogeniety for the GO glycoform. This 
advantageously results in plant host expression systems that have increased 
production consistency, as well as reduced chemical, manufacturing, and control 
(CMC) risk associated v^th the production of these glycoprotein compositions. 

20 The glycoprotein compositions produced in accordance with the methods of 

the present invention may be contained in a composition comprising a 
pharmaceutically acceptable carrier. Such compositions are useful in a method of 
treating a subject for a disease or disorder for which treatment with the glycoprotein 
will provide a therapeutic benefit. In this manner, glycoproteins that have been 

25 produced in a plant, for example, a duckweed plant, stably transformed in accordance 

with the methods of the present invention can be administered to a subject in need 
thereof 

The glycoprotein compositions of the invention comprise TV-linked glycans 
that are predominately of the GO glycan structure. In this manner, the present 
30 invention provides glycoprotein compositions that have glycosylation profiles that are 

"substantially homogeneous" or "substantially uniform" or have "substantial 
homogeneity" as defined herein above. Thus, in some embodiments, the glycoprotein 
compositions are substantially homogeneous for the GO glycan species, and thus have 
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a substantially homogeneous glycosylation profile, wherein at least 80%, at least 85%, 
at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% of the total 
amount of A^-glycan species appearing in the glycosylation profile for the composition 
is represented by the GO glycan species, with a trace amount of precursor A/^glycan 
5 species appearing in the glycosylation profile, i.e., any given precursor A^-glycan 

species that is present in the glycosylation profile is present at less than 5%, 
preferably less than 4%, less than 3%, less than 2%, less than 1%, and even less than 
0.5% or even less than 0.1% of the total amount of /V-glycan species appearing in the 
profile. For such a composition, a representative precursor A^-glycan species 

10 appearing in its glycosylation profile would be the Man3GlcNAc2, MGn 

(GlcNaclMan3GlcNAc2 wherein GlcNacl is attached to the 1,3 mannose arm), and 
GnM (GlcNacl Man3GlcNAc2 wherein GlcNacl is attached to the 1,6 mannose arm) 
precursor A^-glycan species described above, where any single one or any combination 
of these precursor A^-glycan species can be present. 

15 In this manner, the invention provides "substantially homogeneous" or 

"substantially uniform" glycoprotein compositions or glycoprotein compositions 
having "substantial homogeneity" as defined herein above. In some embodiments, 
the invention provides substantially homogeneous glycoprotein compositions, 
wherein at least 80%, at least 85%, at least 90%, 91%. 92%, 93%, 94%, 95%, 96%, 

20 97%, 98%, or at least 99% of the glycoprotein present in the composition is 

represented by the GO glycoform, wherein all anticipated glycosylation sites are 
occupied by the GO glycan species, with a trace amount of precursor or undesired 
glycoforms being present in the composition, i.e., the precursor glycoforms represent 
less than 5%, less than 4%, less than 3%, less than 2%, less than 1%. or even less than 

25 0.5%, or less than 0. 1 % of the total glycoforms present within the composition. In 

such a composition, a representative precursor glycoform would be one in which 
glycosylation sites are unoccupied, and an exemplary undesired glycoform would be a 
glycoform having a mixture of GO glycan and GOX or G0XF3 glycan species attached 
to its glycosylation sites. 

30 In some embodiments of the invention, the plant host comprises one or more 

polynucleotides that provide for expression of an antibody that specifically binds to a 
mammalian protein of interest, particularly a human protein of interest. Thus, in one 
aspect, the invention provides methods for producing monoclonal antibodies in higher 
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plants, wherein the monoclonal antibodies have an A/-glycosylation pattern that 
reflects a reduction in the amount of pi,2-linked xylose residues and al,3-linked 
fticose residues within the A^-linked glycans, and compositions comprising 
recombinant monoclonal antibodies produced using plant hosts genetically modified 
5 in the manner set forth herein. In some embodiments, the plant host of interest is a 

member of the duckweed family. 

Monoclonal antibodies are increasingly being used as therapeutic agents to 
treat human disease, including, but not limited to, cancer and diseases having an 
autoimmune or inflammatory component. See, for example. King (1999) Curr. Opin, 

10 Drug Discovery Dev, 2:110-17; Vaswani and Hamilton {\99%) Ann, Allergy Asthma 

Immunol 81:105-19; and Holliger and Hoogenboom, NaU Biotechnology 16:1015-16; 
each of which is herein incorporated by reference. Although some of these antibodies 
have therapeutic effects that result solely from antigen binding, for example 
antibodies that bind to a receptor or ligand to prevent ligand-receptor interactions, 

15 other antibodies need effector functions such as the recruitment of the immune system 

to kill target cells in order to be therapeutically active. See, for example, Clynes et al 
(2000) Nat Med, 6:443-46; Clynes et al (1998) Proc. Natl Acad. Sci. U.S.A. 95:652- 
56, and Anderson et al (1997) Biochem. Soc. Trans. 25:705-8; each of which is 
herein incorporated by reference. 

20 The antigen-recognition activities and effector functions of antibodies reside 

in different portions of the antibody molecule. The Fab' portion of the antibody 
provides antigen recognition activity, while the Fc portion provides effector functions 
such as the activation of accessory effector cells including phagocytic cells 
(macrophages and neutrophils), natural killer cells, and mast cells. Antibodies bind to 

25 cells via the Fc region, with an Fc receptor site on the antibody Fc region binding to 

an Fc receptor (FcR) on a cell. There are a number of Fc receptors that are specific 
for different classes of antibodies, including IgG (gamma receptors), IgE (eta 
receptors), IgA (alpha receptors) and IgM (mu receptors). Binding of antibody to Fc 
receptors on cell surfaces triggers a number of important and diverse biological 

30 responses including engulfment and destruction of antibody-coated particles, 

clearance of immune complexes, lysis of antibody-coated target cells by killer cells 
(called antibody-dependent cell-mediated cytotoxicity, or ADCC), initiation of 
complement-dependent cytotoxicity (CDC), release of inflammatory mediators, and 
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control of immunoglobulin production. Methods for assaying effector function of 
antibodies are well known in the art and include those assaying for CDC, ADCC, and 
apoptosis. See, for example, Subbramanian et ah (2002) 1 Clin, Microbiol 40:2141- 
2146; Ahman et al (1994) J. Immunol Methods 36:243-254; Brezicka et al (2000) 
5 Cancer Immunol Immunother, 49:235-242; Gazzano-Santoro et al (1 997) 7. 

Immunol Methods 202:163-171; Prang et al (2005) British J, Cancer 92:342-349; 
Shan et al. (1998) Blood 92:3756-377 1; Ghetie et al (2001) Blood 97: 1392-1398; 
and, Mathas et al (2000) Cancer Research 60:7170-7176; all of which are herein 
incorporated by reference. 

10 It is known in the art that the glycosylation status of the Fc portion of an 

antibody molecule plays a key role in determining whether an antibody will have 
effector function. See, for example, Tao and Morrison (1987) /. Immunol 143:2595- 
601; Wright and Morrison (1997) Trends in Biotech, 15:26-32; Wright and Morrison 
(1998) y. Immunol 160:3393-402; Mimurae/a/. (2000) M?/. Immunol 37:697-706; 

15 Jefferis and Lund (2002) Immunol Lett, 82:57-65; Krapp et al (2003) J, Mol Biol 

325:979-89; and Jefferis (2005) Biotechnol Prog, 21:11-16; each of which is herein 
incorporated by reference. Glycosylation of recombinantly produced antibodies 
varies depending on the expression system used. See, for example, Raju et al (2000) 
Glycobiology 10:477-86; Wright and Morrison (1997) Trends in Biotech, 15:26-32. 

20 Further, where the //-glycosylation pattern of a mammalian-produced monoclonal 

antibody is altered to reduce or deplete the a(l ,6)-linked core fucose residue, the 
monoclonal antibody exhibits increased effector function, particularly increased 
ADCC activity. See, for example, U.S. Patent No. 6,946,292. For some mammalian- 
produced monoclonal antibodies, where the A/-glycosylation pattern is altered to 

25 reduce or deplete the P(l,4)-galactose residues attached to the 1,3 and/or 1,6 mannose 

arms, activation of complement-dependent cytotoxicity (CDC) against antigen- 
bearing target cells may be reduced Mdthout altering other ftinctional activities of the 
antibody, including ADCC activity. See, for example, Boyd et al (1995) Mol 
Immunol 32:1311-1318. 

30 Antibodies having antigen recognition activity, and in some embodiments 

improved effector function, may be produced by a higher plant host, such as 
duckweed, that has been stably transformed in the manner set forth herein to alter its 
glycosylation machinery. Accordingly, the present invention provides methods for 
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producing a recombinant monoclonal antibody, including a monoclonal antibody 
having improved effector function, wherein the antibody is recombinantly produced 
within a plant having an altered A^glycosylation pattern of endogenous and 
heterologous gylcoproteins produced therein such that these glycoproteins exhibit a 
5 reduction in the amount of the plant-specific pl,2-linked xylose residues and/or al,3- 

linked fucose residues attached to the A/-glycans thereof Where the antibodies have 
reduced amounts al,3-linked fucose residues attached to the 7V-glycans thereof, the 
antibodies may have increased ADCC activity relative to antibodies produced in a 
control plant that has not been genetically modified to inhibit expression or function 

10 ofFucT. 

Also encompassed are recombinant monoclonal antibodies having effector 
function, and in some embodiments, improved effector function, where the antibodies 
are produced in a duckweed expression system that has been genetically modified to 
inhibit expression or function of the FucT of SEQ ID N0:3 and/or the XylT of SEQ 

1 5 ID N0:6, and any isoforms thereof, for example, the XylT isoform #2 comprising the 

sequence set forth in SEQ ID N0:2 1 (encoded by SEQ ID NO:20). Thus, in some 
embodiments, the plant serving as the host for recombinant production of the 
monoclonal antibody is a member of the Lemnaceae as noted elsewhere herein, for 

^ example, a Lemna plant, comprising, for example, a XylT RNAi expression cassette 

20 and/or a FucT RNAi expression cassette described above stably integrated within its 

genome. In this manner, the present invention provides a method for producing a 
recombinant monoclonal antibody having an A^-glycosylation pattern that more 
closely resembles that found in a mammalian host expression system, and with 
improved effector function, where the method comprises expressing one or more 

25 chains of the antibody in a duckweed plant, or duckweed cell or duckweed nodule, 

that has been genetically modified to alter the glycosylation machinery such that the 
recombinantly produced monoclonal antibody exhibits a reduction in the attachment 
of the plant pi,2-linked xylose residue and/or al,3-linked fucose residue to the N- 
glycans thereof, and culturing the duckweed plant, or duckweed cell or duckweed 

30 nodule, under conditions suitable for expression of the monoclonal antibody. 

Thus the present invention provides novel antibody compositions wherein the 
antibody comprises A^-linked glycans that are predominately of the GO glycan 
structure. In this manner, the present invention provides antibody compositions, for 
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example, monoclonal antibody compositions, that have glycosylation profiles that are 
"substantially homogeneous" or "substantially uniform" or have "substantial 
homogeneity" as defined herein above. Thus, in some embodiments, the antibody 
compositions are substantially homogeneous for the GO glycan species, and thus have 
S a substantially homogeneous glycosylation profile, wherein at least 80%, at least 85%, 

at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% of the total 
amount of A^-glycan species appearing in the glycosylation profile for the composition 
is represented by the GO glycan species, with a trace amount of precursor /V-glycan 
species appearing in the glycosylation profile, i.e., any given precursor //-glycan 

10 species that is present in the glycosylation profile is present at less than 5%, 

preferably less than 4%, less than 3%, less than 2%, less than 1%, and even less than 
0.5% or even less than 0.1% of the total amount of iV-glycan species appearing in the 
profile. For such a composition, a representative precursor //-glycan species 
appearing in its glycosylation profile would be, for example, the Man3GlcNAc2, 

15 MGn (GlcNaclMan3GlcNAc2 wherein GlcNacl is attached to the 1,3 mannose arm), 

and GnM (GlcNaclMan3GlcNAc2 wherein GlcNacl is attached to the 1,6 mannose 
arm) precursor //-glycan species described above, where any single one or any 
combination of these precursor A^-glycan species can be present. 

In one such embodiment, the monoclonal antibody composition has a 

20 substantially homogeneous glycosylation profile, wherein 95.8% of the total amount 

of //-glycan species appearing in the glycosylation profile for the composition is 
represented by the GO glycan species (GlcNAciMansGlcNAca), with the following 
precursor //-glycan species appearing in the glycosylation profile: Man3GlcNAc2 
(0.67%), GlcNAcManaGlcNAcz (1.6%), GalGlcNAcaMansGlcNAci (K2%), 

25 MangGlcNAcz (0.2 1 %), Man7GlcNAc2 (0.30%), and MangGlcNAci (0.28%). This 

can be compared with the monoclonal antibody composition obtained from the 'Svild- 
type" duckweed plant expression system wherein the same monoclonal antibody is 
expressed but where the glycosylation machinery of the duckweed plant has not been 
genetically modified to inhibit expression of XylT and FucT. Such a "wild-type "- 

30 derived monoclonal antibody composition has a more heterogeneous glycosylation 

profile that is characterized by two predominant //-glycan species, i.e., GOXF^ and 
GOX, with several precursor A^-glycan species represented in trace amounts. In one 
such embodiment, the "wild-type"-derived monoclonal antibody composition has a 
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glycosylation profile with the following iV-glycan species represented therein: GO 
(GlcNAc2Man3GlcNAc2) (8.4%); GOX (GlcNAc2[Xyl]Man3GlcNAc2) (17,2%); 
GOXF^ (GlcNAc2[Xyl]Man3[Fuc]GlcNAc2) (67.4%); MansGlcNAca (0.26%); 
GlcNAcManjGlcNAcz (0.40%); p(yl)Man3(Fuc)GlcNAc2 (0,76%); 
5 GlcNAc2Man3(Fuc)GlcNAc2 (2. 1%); GlcNAc(Xyl)Man3(Fuc)GlcNAc2 (1.4%); 

Man6GlcNAc2 (0.21%); Man7GlcNAc2 (0.63%); 

Gal(Fuc)GlcNAc2(Xyl)Man3(Fuc)GlcNAc2 (0.26%); MangGlcNAcz (0.61%); and 
Man9GlcNAc2 (0.40%). 

In this manner, the invention provides "substantially homogeneous" or 

10 "substantially uniform" antibody compositions or antibody compositions having 

"substantial homogeneity" as defined herein above. In some embodiments, the 
invention provides substantially homogeneous antibody compositions, for example, 
monoclonal antibody compositions, wherein at least 80%, at least 85%, at least 90%, 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% of the antibody present 

15 in the composition is represented by the GO glycoform, wherein all anticipated 

glycosylation sites (for example, each of the Asn-297 residues of the Ch2 domains of 
the heavy chains of an IgG-type antibody) are occupied by the GO glycan species, 
with a trace amount of precursor glycoforms being present in the composition. In one 
such composition, the precursor glycoforms are selected from the group consisting of 

20 an antibody having an Fc region wherein the Ch2 domain of one heavy chain has a GO 

glycan species attached to Asn 297, and the Ch2 domain of the other heavy chain is 
unglycosylated; an antibody having an Fc region wherein the Ch2 domain of one 
heavy chain has a GO glycan species attached to Asn 297, and the Ch2 domain of the 
other heavy chain has the GnM or MGn precursor glycan attached to Asn 297; and an 

25 antibody having an Fc region wherein the Asn 297 glycosylation site on each of the 

Ch2 domains has a GO glycan species attached, with a third GO glycan species 
attached to an additional glycosylation site within the mAb structure; wherein a trace 
amount of these precursor glycoforms is present, i.e., the precursor glycoforms 
represent less than 5%, less than 4%, less than 3%, less than 2%, less than 1%, or 

30 even less than 0.5%, or less than 0. 1% of the total glycoforms present within the 

antibody composition. 

The substantially homogeneous antibody compositions of the invention 
wherein at least 80%, at least 85%, at least 90%, 91%, 92%, 93%, 94%, 95%. 96%, 
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97%, 98%, or at least 99% of the antibody present in the composition is represented 
by the GO glycoform represent "glycan-optimized antibodies." By "glycan-optimized 
antibodies" is intended the antibodies of the invention have been genetically 
engineered in their glycosylation pattern such that they are substantially homogeneous 
5 for the GO glycoform, which yields an antibody having improved Fc effector function. 

By "improved Fc effector function" is intended these antibodies have increased 
ADCC activity relative to same-sequence antibodies (i.e., antibodies that have the 
same amino acid sequence) that, as a result of the production process, have a more 
heterogeneous glycosylation profile. Thus, for example, antibodies produced in 

10 manfunalian host expression systems, for example CHO cells, in insect host.cells, in 

yeast cells, or in other plant host expression systems that have not been genetically 
altered to inhibit XylT and FucT expression tend to have more heterogeneous 
glycosylation profiles, and thus a mixture of glycoforms, that can effect overall 
effector function of the antibody product. The GO glycoform of the antibody 

15 compositions of the present invention advantageously provides an antibody 

composition that has increased ADCC activity in association with the absence of 
fucose residues. In some embodiments, ADCC activity is increased by 25-fold, 50- 
fold, 75-fold, 100-fold, 150-fold, 200-fold. 250-fold, 300-fold, 400-fold, 500-fold, or 
even 1000-fold relative to same-sequence antibodies having a heterogeneous 

20 glycosylation profile (i.e., with multiple glycoforms present as major glycoforms in 

the antibody composition). Furthermore, the GO glycoform lacks the terminal Gal 
residues present in antibodies having the G2 glycoform. As such, these substantially 
homogeneous antibody compositions of the invention having predominately the GO 
glycoform have increased ADCC/CDC ratios. In addition, the substantially 

25 homogeneous antibody compositions of the invention having predominately the GO 

glycoform have increased binding to the FcyRIII, for example, FcyRIUa, wherein 
binding affinity is increased about 20-fold, 30-fold, 40-fold, 50-fold, 75-fold, up to 
100- fold over that observed for same-sequence antibody compositions having a 
heterogeneous glycosylation profile, and thus a mixture of glycoforms. For oncology 

30 and autoimmune diseases, therapeutic antibodies having increased binding affinity for 

Fc receptors, for example, FcyRlIl, has been strongly correlated with increased 
efficacy and improved response to treatment. 
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In some embodiments of the invention, the substantially homogeneous 
antibody compositions of the invention having predominately the GO glycoform have 
altered CDC activity when compared to that observed for same-sequence antibody 
compositions having a heterogeneous glycosylation profile, and thus a mixture of 
5 glycoforms. For example, in one such embodiment, the substantially homogeneous 

antibody compositions of the invention have predominately the GO glycoform and 
have decreased CDC activity when compared to that observed for same-sequence 
antibody compositions having a heterogeneous glycosylation profile, and thus a 
mixture of glycoforms. Thus, in some embodiments, the present invention provides 

10 substantially homogeneous antibody compositions having predominately the GO 

glycoform and CDC activity that is decreased by as much as 5%, 10%, 15%, 20%, 
25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 
95%, or even 100% when compared to same-sequence antibody compositions having 
a heterogeneous glycosylation profile. In some of these embodiments, the 

15 substantially homogeneous antibody compositions having predominately the GO 

glycoform and decreased CDC activity are further characterized by having increased 
ADCC activity when compared to same-sequence antibody compositions having a 
heterogeneous glycosylation profile. 

Without being bound by any theory or mechanism of action, a substantially 

20 homogeneous GO glycoform antibody composition of the present invention having 

decreased CDC activity, and similar or increased ADCC activity in the manner 
described above, may advantageously provide for increased cytotoxicity against target 
cells while reducing potential adverse side effects that may be associated with 
complement activation following its administration. By reducing the potential for 

25 these adverse side effects, the substantially homogeneous antibody compositions 

having predominately the GO glycoform can advantageously be administered at faster 
infusion rates, thereby reducing dosing time at any given administration, and/or can 
be dosed at higher initial concentrations if warranted, with reduced concern for 
triggering adverse side affects associated with complement activation. 

30 For example, complement activation plays a pivotal role in the pathogenesis of 

moderate to severe first-dose side effects of treatment with the chimeric anti-CD20 
monoclonal antibody IDEC-C2B8 (IDEC Pharmaceuticals Corp., San Diego, 
California; commercially available under the tradename Rituxan®, also referred to as 
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rituximab). See, for example, van der Kolk et al (2001) British J. Haematol 
1 15:807-81 1. The rituximab antibody within the Rituxan® product is expressed 
within Chinese hamster ovary (CHO) cells, and thus the antibody composition 
comprises a heterogeneous glycosylation profile (i.e., a mixture of glycoforms). CDC 

5 activity of rituximab has been shown to be correlated v^th galactose content. In this 

manner, as the number of galactose residues increases from 0-2 moles/mole of heavy 
chain, the level of CDC activity increases from 80% (P-galactosidase treated to 
remove all |3(l,4)-galactose residues from the 1,3 and 1,6 mannose arms of the N- 
glycans attached to Asn 297 of the Ch2 domains of the heavy chains) to 150% (UDP 

10 galactosyl transferase treated to ensure P(l ,4)-galactose residues are attached to both 

the 1,3 and 1,6 mannose arms of the iV-glycans attached to Asn 297 sites) of the 
maxiumum observed for the antibody having 1 mole galactose/mole of heavy chain 
(see, IDEC BLA 97-0260 at the website 

fda.gov/Cder/biologics/review/ritugenl 12697, available on the worldwide web). A 

15 substantially homogeneous anti-CD20 antibody composition comprising anti-CD20 

antibody having the same sequence as rituximab and having predominately the GO 
glycoform would advantageously have decreased CDC activity, thereby reducing the 
potential for adverse side effects normally associated with complement activation 
upon antibody administration when the antibody composition comprises a 

20 heterogenous glycosylation profile (i.e.. a mixture of glycoforms). 

Thus, a substantially homogeneous GO glycoform antibody composition of the 
present invention having decreased CDC activity, and the same or increased ADCC 
activity, can advantageously be used in therapeutic applications that have heretofore 
been unsuitable, inadvisable, or inefficacious for one or more patient populations as a 

25 result of complications due to adverse side effects normally associated with 

complement activation upon administration of the same-sequence antibody 
composition that comprises a heterogeneous glycosylation profile (i.e., a mixture of 
glycoforms). Such side effects include, but are not limited to, moderate to severe side 
effects that can be associated with first-time and/or rapid administration of an 

30 antibody, including, for example, fever and/or chills, nausea, dyspnea, flushes, and the 

like. See, for example, van der Kolk et al, (2001) British J. Haematol 11 5:807-81 1 
and the references cited therein; Winkler et al (1999) Blood 94:2217-2224). In this 
manner, the present invention provides a method for reducing one or more adverse 
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side effects related to complement activation upon administration of an antibody, for 
example, a monoclonal antibody, the method comprising administering the antibody 
as a substantially homogeneous antibody composition as defined herein above, and 
thus at least 80%. at least 85%, at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
5 98%, or at least 99% of the antibody present in the composition is represented by the 

GO glycoform, with a trace amount of precursor glycoforms being present in the 
composition. In some embodiments, at least 90%, at least 95%, or at least 99% of the 
antibody present in the composition is represented by the GO glycoform, with a trace 
amount of precursor glycoforms being present in the composition. 

10 As a result of their increased Fc effector function, the substantially 

homogenous antibody compositions of the invention having the predominately GO 
glycoform provide the opportunities for new and improved routes of administration, 
for example, extending the possible routes of administration for known therapeutic 
antibodies to other routes of administration beyond infusion and intravenous 

15 administration, for example, to subcutaneous administration. Furthermore, as a result 

of their increased potency, the antibody compositions of the invention can be dosed at 
lower concentrations, or dosed at lower volumes, and dosed with less frequency. A 
reduction in the volume of the administered antibody composition is particularly 
advantageous in those instances where adverse events resulting from infusion 

20 reactions with a monoclonal antibody are volume-related. The increased potency of 

the antibody compositions of the invention also opens up new clinical indications for 
existing mAb targets that may not have been responsive to antibody therapy with 
more heterogenous glycoform antibody compositions, and provides the ability to 
revisit previously undeveloped mAb targets. 

25 The monoclonal antibodies produced in accordance with the methods of the 

present invention may be contained in a composition comprising a pharmaceutically 
acceptable carrier. Such compositions are useful in a method of treating a subject in 
need of an antibody having effector function, and in some embodiments, improved 
effector function where FucT expression has been targeted for inhibition. In this 

30 manner, monoclonal antibodies produced in a plant, for example, a duckweed plant, 

stably transformed in accordance with the methods of the present invention can be 
administered to a subject in need thereof 



-82- 



wo 2007/084922 PCT/US200 7/060642 



Any antibody may be produced within a plant host genetically modified 
according to the methods of the invention. In one embodiment, the antibody is a 
therapeutic antibody. Antibodies have been approved for the treatment of a number 
of disorders, and many additional therapeutic antibodies are in development. See, 
5 Brekke and Sandlie (2003) NaL Rev. Drug, Discov. 2(3):240, herein incorporated by 

reference. Examples of therapeutic antibodies that may be expressed by the methods 
of the invention include, but are not limited to, anti-CD3 antibodies targeting the CD3 
antigen (g.g., 0KT®3); anti-platelet gpHb/IIIa antibodies (e.g. abciximab (previously 
known as c7E3 Fab), distributed as ReoPro®); anti-EpCAM antibodies (e.g., 

10 Panorex®); anti-CD20 antibodies targeting the CD20 antigen (eg. rituximab, 

distributed as Rituxan®; see U.S. Patent No. 5,736,137, herein incorporated by 
reference); anti IL-2 receptor (e.g. Zenapax®, Simulect®); anti-ERBB2 (e.g. 
trastuzumab, distributed as Herceptin®; see U.S. Patent No. 6,165,464, herein 
incorporated by reference); anti-TNF-a (e.g., infliximab, distributed as Remicade®), 

15 anti-F-protein (e.g. Synagis®); anti-CD30 antibodies targeting the CD30 antigen (see, 

for example, the 5F1 1 antibody described in Borchmann et al (2003) Blood 
102:3737-3742 and in Example 6 herein below, see also WO 03/059282 and U.S. 
Patent Application Publication No. 2004/0006215, herein incorporated by reference; 
the SGN-30 antibody described in Wahl et al. (2002) Cancer Res. 62:3736-3742, a 

20 chimeric version of the ACIO anti-CD30 antibody, see also U.S. Patent Application 

Publication No. 20040018194 and U.S. Patent No. 7,090,843, herein incorporated by 
reference); anti-CD33 antibodies targeting the CD33 antigen (e.g. Mylotarg®; see 
U.S. Patent No. 5.733,001, herein incorporated by reference); anti-CD52 antibodies 
targeting the CD52 antigen (e.g., alemtuzumab, distributed as Campath®; see U.S. 

25 Patent No. 5,846,534, herein incorporated by reference); anti-CD20 (e.g., Zevalin®); 

anti-IgE Fc (e.g., omalizumab, marketed under Xolair®); anti-VEGF (e.g., 
bevacizumab, distributed as Avastin®; see, Presta el al. (1997) Cancer Res. 57:4593- 
9; herein incorporated by reference); anti-EGF-R (e.g., cetuximab, distributed as 
Erbitux®)); anti-CDlla (e.g., efalizumab, distributed under Raptiva®), anti-IL-8; 

30 anti-C5; anti-TNF-a (e.g., adalimumab, distributed as Humira®); anti-TGF-P2; anti- 

IL2 receptor (e.g., daclizumab (Zenapax) and basiliximab (Simulect)); anti-a-4- 
integrin; anti-CD4; anti-CD2; anti-CD19 (e.g., MT103, a bispecific antibody); anti- 
CD22 antibody targeting the CD22 antigen (e.g., the monoclonal antibody BL-22); 
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anti-CD23 antibody targeting the CD23 antigen on malignant B cells (e.g., IDEC- 
152); anti-CD80 antibody targeting the CD80 antigen (e.g., IDEC-1 14); anti-CD38 
antibody targeting the CD38 antigen on malignant B cells; a-M-CSF antibody 
targeting macrophage colony stimulating factor, antibodies targeting the receptor 
5 activator of nuclear factor-kappaB (RANK) and its ligand (RANKL), which are 

overexpressed in multiple myeloma; anti-CD40 antibodies (e.g., SGN-40) targeting 
the CD40 antigen on malignant B cells; antibodies targeting tumor necrosis factor- 
related apoptosis-inducing ligand receptor 1 (TRAIL-Rl) (e.g., the agonistic human 
monoclonal antibody HGS-ETRl) and TRAIL-R2 expressed on a number of solid 
10 tumors and tumors of hematopoietic origin)anti-HBV antibodies; or anti-MHC class 11 

antibodies. Many of these antibodies are known to require effector activity for 
function, either CDC and/or ADCC, for example, Herceptin®, Rituxan®, Mylotarg®, 
and Campath®. 

In some embodiments, the plant host that has been genetically modified to 

15 alter its glycosylation machinery in the manner set forth herein, for example, 

duckweed, may express biologically active a-2b-interferon, for example human a-2b- 
interferon precursor (NCBI Protein Accession No. AAB59402) or mature human a- 
2b-interferon (amino acids 24-188 of NCBI Protein Accession No. AAB9402) or 
biologically active variants thereof. Examples of biologically active variants of 

20 human a-2b-interferon are known in the art. See, for example, WO 2005/035767, 

disclosing truncated versions of a-2b-interferon; European patent EP21 1 148B1; and 
U.S. Patent Nos. 4,748,233, 4,801,685, 4,816,566, 4,973,479, 4,975,276, 5,089,400, 
5,098,703, 5,23 1,176, and 5,869,293; herein incorporated by reference. In other 
embodiments, the plant host, for example, duckweed, may express biologically active 

25 mature human growth hormone with or without its accompanying signal peptide. The 

plant host may also express biologically active variants of human growth hormone. 
See, for example, WO 02/10414, directed to expression of biologically active 
polypeptides using a duckweed expression system. 

In yet other embodiments, the plant host, for example, duckweed, may express 

30 plasminogen, microplasminogen, or biologically active variants thereof. The 

plasminogen or microplasminogen to be expressed in the plant host may be from any 
mammalian source. In some embodiments, the plasminogen or microplasminogen is 
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human or porcine. See, for example, WO 2005/078109, directed to expression of 
these proteins using a duckweed expression system. 

By "biologically active variant" of these heterologous polypeptides is intended 
a polypeptide derived from the native polypeptide by deletion (so-called truncation) 
5 or addition of one or more amino acids to the N-terminal and/or C-terminal end of the 

native protein; deletion or addition of one or more amino acids at one or more sites in 
the native protein; or substitution of one or more amino acids at one or more sites in 
the native protein. Variants of a heterologous polypeptide of interest encompassed by 
the present invention retain the biological activity of the native polypeptides. 

10 For example, biologically active variants of a-2b-interferon continue to 

possess the desired biological activity of the native a-2b-interferon including the 
ability to increase resistance to viral infection, or the ability to modulate the 
transcription of a-2b-interferon-regulated gene targets. Such biologically active 
variants may result from, for example, genetic polymorphism or from human 

15 manipulation. Biologically active variants of a heterologous protein of interest will 

have at least about 50%, 60%, 65%, 70%, generally at least about 75%, 80%, 85%, 
preferably at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, and more 
preferably at least about 98%, 99% or more sequence identity to the amino acid 
sequence for the heterologous protein of interest. Thus, for example, a biologically 

20 active variant of native a-2b-interferon will have at least about 50%, 60%, 65%, 70%, 

generally at least about 75%, 80%, 85%, preferably at least about 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, and more preferably at least about 98%, 99% or more 
sequence identity to the amino acid sequence forhuman a-2b-interferon precursor 
(NCBI Protein Accession No. AAB59402) or mature human a-2b-interferon (amino 

25 acids 24-188 of NCBI Protein Accession No. AAB9402), where sequence identity is 

determined using the parameters identified herein above. Thus, a biologically active 
variant of a native heterologous protein of interest may differ from that protein by as 
few as 1-15 amino acid residues, as few as 1-10, such as 6-10, as few as 5, as few as 
4, 3, 2, or even 1 amino acid residue. 

30 In one embodiment, the plant host that has been genetically modified to alter 

its glycosylation machinery in the manner set forth herein is a stably transformed 
duckweed plant or duckweed cell or duckweed nodule that expresses biologically 

r 

active polypeptides that cannot effectively be conunercially produced by existing 

* 
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gene expression systems, because of cost or logistical constraints, or both. For 
example, some proteins cannot be expressed in mammalian systems because the 
protein interferes with cell viability, cell proliferation, cellular differentiation, or 
protein assembly in mammalian cells. Such proteins include, but are not limited to, 
S retinoblastoma protein, pS3, angiostatin, and leptin. The present invention can be 

advantageously employed to produce mammalian regulatory proteins; it is unlikely 
given the large evolutionary distance between higher plants and mammals that these 
proteins will interfere with regulatory processes in duckweed. Transgenic duckweed 
can also be used to produce large quantities of proteins such as serum albumin (in 

10 particular, human serum albumin), hemoglobin, and collagen, which challenge the 

production capabilities of existing expression systems. 

Finally, higher plant systems can be engineered to produce biologically active 
multimeric proteins (e.g., monoclonal antibodies, hemoglobin, P450 oxidase, and 
collagen, and the like) far more easily than can mammalian systems. One exemplary 

1 5 approach for producing biologically active multimeric proteins in duckweed uses an 

expression vector containing the genes encoding all of the polypeptide subunits. See, 
e.g., During et al, (1 990) Plant MoL Biol 15:281 and van Engelen et al (1994) Plant 
Mol. Biol. 26: 1701. The expression cassette comprising the XylT and/or FucT 
inhibitory polynucleotide can be introduced into such a vector. This vector is then 

20 introduced into duckweed cells using any known transformation method, such as a 

ballistic bombardment or Agrobacterium-mediBied transformation. This method 
results in clonal cell lines that express all of the polypeptides necessary to assemble 
the multimeric protein, as well as the XylT and/or FucT inhibitory sequences that alter 
the glyc'osylation pattern of the //-glycans of glycoproteins. Accordingly, in some 

25 embodiments, the transformed duckweed contains one or more expression vectors 

encoding a heavy and light chain of a monoclonal antibody or Fab* antibody fragment, 
and an expression cassette comprising the XylT and/or FucT inhibitory 
polynucleotide, and the monoclonal antibody or antibody fragment is assembled in the 
duckweed plant from the expressed heavy and light chain. 

30 A variation on this approach is to make single gene constructs, mix DNA from 

these constructs together, then deliver this mixture of DNAs into plant cells using 
ballistic bombardment or Agrobactenum'Xneilidit&i transformation. As a further 
variation, some or all of the vectors may encode more than one subunit of the 
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multimeric protein (i.e., so that there are fewer duckweed clones to be crossed than 
the number of subunits in the multimeric protein). In an alternative embodiment, each 
duckweed clone has been genetically modified to alter its glycosylation machinery 
and expresses at least one of the subunits of the multimeric protein, and duckweed 
5 clones secreting each subunit are cultured together and the multimeric protein is 

assembled in the media from the various secreted subunits. In some instances, it may 
be desirable to produce less than all of the subunits of a multimeric protein, or even a 
single protein subunit, in a transformed duckweed plant or duckweed nodule culture, 
e.g., for industrial or chemical processes or for diagnostic, therapeutic, or vaccination 

10 purposes. 

In some embodiments of the invention, the transgenic plant host of interest is a 
"high expressor" of a glycoprotein described herein, including, for example, the 
glycoproteins comprising iV-1 inked glycans that are predominately of the GO glycan 
structure. By "high expressor" is intended the transgenic plant host that has been 

15 engineered to produce the glycoproteins described herein is capable of producing the 

glycoprotein of interest at a level such that the glycoprotein of interest represents at 
least 5% or more of the total soluble protein produced in the transgenic plant host. In 
some embodiments, a "high expressor" is a transgenic plant host that has been 
engineered to produce the glycoproteins described herein such that the glycoprotein of 

20 interest represents at least 5%, 6%, 7%, 8%, 9%, 10%, 1 1%, 12%, 13%, 14%, 15%. or 

more of the total soluble protein produced in the transgenic plant host. Thus, for 
example, in one embodiment, the transgenic plant host is a duckweed that has been 
modified to inhibit expression of XylT and FucT, and the transgenic duckweed is a 
high expressor of a glycoprotein described herein. In some of these embodiments, the 

25 transgenic duckweed is a high expressor of a glycan-optimized monoclonal antibody 

having the predominate GO glycoform described herein above. In yet other 
embodiments, the transgenic duckweed expresses the glycan*optimized monoclonal 
antibody such that this glycoprotein represents about 5%, 6%, 7%, 8%, 9%, 10%, 
1 1%, 12%, 13%, 14%, 15%, or greater, of the total soluble protein. 



30 



Further Humanization of Glycoproteins 

In some embodiments, it may be desirable for the glycoproteins of the present 
invention to comprise complex A^-glycans having the terminal P(l,4)-galactose 
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residues attached to the 1,3 and/or 1,6 mannose arms. Without being bound by 
theory, these terminal galactose residues may contribute to the therapeutic function 
and/or pharmacokinetic activity of a glycoprotein. It is recognized that the methods 
of the present invention can be paired with other methods known in the art to further 
5 modify the glycoproteins of the invention such that one or more of the AT-glycans 

attached thereto comprises one or more terminal galactose residues, i.e., wherein one 
or more of the //-glycans is represented by the Gl or G2 glycan species. In this 
manner, the glycoprotein compositions of the present invention, including the 
monoclonal antibodies described herein, can be modified, for example, enzymatically 

1 0 with use of a glycosyl transferase enzyme to obtain glycoproteins having a 

substantially homogenous glycosylation profile for the Gl, preferably for the G2 
glycan species. See, for example, U.S. Patent Application Publication No. 
2004/0191256, herein incoroporated by reference in its entirety, teaching 
galactosyltransferase modification of a substrate glycoprotein to obtain a glycoprotein 

15 wherein substantially all of the TV-linked glycan species are of the G2 form. In this 

manner, the glycoprotein of interest can be reacted with an activated galactose in the 
presence of a galactosyltransferase and a metal salt. The galactosyltransferase can be 
a mammalian pi,4 galactosyltransferase (GalT), for example, human GalT, and the 
activated glactose can be, for example, UDP-galactose. 

20 Alternatively, the transgenic plants of the invention having FucT and XylT 

expression silenced in the manner set forth herein can be further modified in their 
glycosylation machinery such that they express a galactosyltransferase and efficiently 
attach the terminal galactose residue to the /V-glycans of endogenous and heterologous 
glycoproteins produced therein. In this manner, the transgenic plants of the invention 

25 can be further modified by introducing a nucleotide construct, for example, an 

expression cassette, that provides for the expression of a galactosyltransferase. The 
galactosyltransferase can be a mammalian pl,4 galactosyltransferase (GalT), for 
example, human GalT (see, for example, U.S. Patent No. 6,998,267, herein 
incorporated by reference in its entirety) or a hybrid GalT (see, for example, WO 

30 03/078637, herein incorporated by reference in its entirety) comprising at least a 

portion of a cytoplasmic tail-transmembrane-stem region of a first 
glycosyltransferase (e.g., a plant glycosyltransferase such as xylosyltransferase, N- 
acetylglycosaminlytransferase or fucosyltransferase) and at least a portion of a 
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catalytic region of a second glycosyltransefersae (e.g., manunalian glycosyltrasferase, 
for example, human GalT). By silencing expression of XylT and FucT in a plant, for 
example, a duckweed, and providing for expression of GalT, for example, human 
GalT, or a hybrid enzyme comprising a portion of the catalytic domain of GalT, for 

5 example, human GalT, in this plant, for example, duckweed, it is possible to obtain 

transgenic plants producing glycoproteins, both endogenous and heterologous, that 
have an altered glycosylation pattern, wherein the iV-linked glycans attached thereto 
have a reduction in the attachment of plant-specific xylose and plant-specific fucose 
residues and which comprise the terminal galactose residues (i.e., G2 glycan species). 

10 In this manner, glycoproteins that have a substantially homogeneous profile for the 

G2 glycan species, and/or which are substantially homogenous for the G2 glycoform 
can be obtained from transgenic plants of the invention. 

In other embodiments, it may be desirable to further modify the glycosylation 
pattern of the glycoproteins of the invention, wherein the A^-linked glycans attached 

IS thereto further comprise a terminal sialic acid residue attached to one or both of the 

galactose residues attached to the 1,3 and 1,6 mannose arms. The addition of the 
terminal sialic acid residue(s) may be required for the sustained stability, and in some 
cases function, of some therapeutic proteins. 

Depending upon the transgenic plant system, natural sialylation of 

20 glycoproteins may occur. Thus, there have been reports in the literature that cultured 

ArabidopsiSy tobacco, and Medicago cultured cells synthesize sialylated glycoproteins 
(Shah et ai (2003) NaL Biotech, 21(12):1470-1471; Joshi and Lopez (2005) Curr. 
Opin, Plant Biol 8(2):223-226). More recently, it was reported that Japanese rice 
express active sialyltransferase-like proteins (Takashima et al. (2006) J, Biochem. 

25 (Tokyo) 139(2):279-287). Hence, there are now orthogonal reports that plants have 

the machinery required to sialylate glycoproteins. 

Where further modification of the glycosylation pattern of the glycoproteins of 
the invention, wherein the A'-linked glycans attached thereto further comprise a 
terminal sialic acid residue attached to one or both of the galactose residues attached 

30 to the 1,3 and 1,6 mannose arms, is desired, the transgenic plants of the invention can 

be modified to express a P-1,4 galactosyltransferase, for example, human P-1,4 
galactosyltransferase, and to express or overexpress a sialyltransferase. Thus, for 
example, the transgenic plants can be further modified to express a sialyltransferase 
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such as a-2,3- and/or a-2,6-sialyltransferase. See, for example WO 2004/071 177; and 
Wee et al (1998) Plant Cell 10: 1759-1768; herein incorporated by reference in their 
entirety. Alternatively, the transgenic plants of the invention can be modified to 
express a P-1,4 galactosyltransferase, for example, human P-1,4 galactosyltransferase, 
S and to express any other enzymes that are deficient in the plant host's sialic acid 

pathway. The strategy(s) employed can be determined after an initial investigation of 
whether the particular plant host, for example, a duckweed, naturally expresses sialic 
acid-containing N-glycans on native or recombinantly produced glycoproteins. For 
example, if there is not evidence for the presence of the terminal sialic acid residues 

1 0 on N-glycans of glycoproteins produced within the transgenic plant host, particularly 

a transgenic plant host engineered to express a P-1,4 galactosyltransferase, then one or 
both of these strategies could be employed to achieve terminal sialylation of the N- 
glycans of glycoproteins produced within the transgenic plant host of interest. 
Alternatively, the glycoprotein compositions of the invention that are 

15 substantially homologous for G2 glycan species or the G2 glycoform can be modified 

by in vitro enzymatic processing; see, for example, U.S. Patent Application 
Publication No. 20030040037; herein incorporated by reference in its entirety. 

It is also recognized that for some glycoproteins produced in the transgenic 
plants of the invention, it may be desirable to have the mammalian a 1-6 fucose 

20 residue attached to the trimannose core structure (Man3GlcNAc2) of the A^-glycan 

species attached thereto. In such embodiments, the transgenic plants of the invention 
can be further genetically modified to express an a 1-6 fucosyltransferase, for 
example, human a 1-6 fucosyltransferase, using glycoengineering methods known in 
the art. 

25 It is recognized that the glycoprotein compositions of the invention can be 

produced by engineering any host cell of interest, including the plant host cells 
exemplified and described herein. In this manner, other protein expression host 
systems in addition to plant hosts, including animal, insect, bacterial cells and the like 
may be used to produce glycoprotein compositions according to the present invention. 

30 Such protein expression host systems may be engineered or selected to express a 

predominant glycoform or alternatively may naturally produce glycoproteins having 
predominant glycan structures. Examples of engineered protein expression host 
systems producing a glycoprotein having a predominant glycoform include gene 
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knockouts/mutations (Shields et al (2002) JBC 277:26733-26740); genetic 
engineering (Umana et al. (1999) Nature Biotech. 17: 176-180); or a combination of 
both. Alternatively, certain cells naturally express a predominant glycoform, for 
example, chickens, humans, and cows (Raju et al. (2000) Glycobiology 10:477-486). 
5 Thus, the expression of a glycoprotein, including an immunoglobulin such as a 

monoclonal antibody, or composition having predominantly one specific glycan 
^ structure according to the present invention can be obtained by one skilled in the art 
by selecting at least one of many expression host systems. Further expression host 
systems found in the art for production of glycoproteins include: CHO cells (see, for 

10 example, WO 9922764A1 and WO 03/035835A1); hybridroma cells (Trebak et aL 

(1999) J. Immunol Methods 230:59-70); insect cells (Hsu et al (1997) JBC 
272:9062-970). See also, WO 04/074499A2 regarding additional plant host systems. 

The glycoproteins produced in accordance with the methods of the present 
invention can be harvested from host cells in which they are recombinantly produced 

15 in order to obtain them in their isolated or purified form. In this maimer, the 

recombinantly produced glycoproteins of the invention are isolated from the host cells 
using any conventional means known in the art and purified, for example, by 
chromatography, electrophoresis, dialysis, solvent-solvent extraction, and the like. 
Thus, the present invention also provides for purified glycoproteins, including 

20 monoclonal antibody compositions, where the glycoproteins have substantially 

homogeneous glycosylation profiles, and are substantially homogeneous for the GO 
glycoform. These purified glycoproteins are substantially free of host cellular 
material, and include preparations of glycoprotein having less than about 30%. 20%. 
10%, 5%, or 1% (by dry weight) of contaminating protein, as noted herein above. In 

25 some embodiments, these purified glycoproteins can include at least 0.001%, 0.005%, 

0.1%, 0.5%, 1%, 1.5%, 2%, 2,5%, 3%, 3.5%, 4%, 4.5%, 5%, 5.5%, 6%, 6.5%, 7%, 
7.5%, 8%, 8.5%, 9%, 9.5%, 10%, 15%, 20%, 25%, or up to about 30% (by dry 
weight) of contaminating protein. Furthermore, for the recombinantly produced 
purified glycoproteins of the invention, optimally culture medium represents less than 

30 about 30%, 20%, 10%, 5%, or 1% (by dry weight) of chemical precursors or non- 

protein-of-interest chemicals within the purified glycoprotein preparation, as noted 
herein above. Thus, in some embodiments, culture medium components within these 
purified glycoproteins can represent at least 0.001%, 0.005%, 0. 1%, 0.5%, 1%. 1.5%, 
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2%, 2.5%, 3%, 3.5%, 4%, 4.5%, 5%, 5.5%. 6%, 6.5%. 7%. 7.5%, 8%, 8.5%. 9%. 
9.5%. 10%, 15%, 20%, 25%, or up to about 30% (by dry weight) of chemical 
precursors or non-protein-of-interest chemicals within the purified glycoprotein 
preparation. In some embodiments, isolation and purification results in recovery of 
5 purified glycoprotein that is free of contaminating host protein, free of culture 

medium components, and/or free of both contaminating host protein and culture 
medium components. 

Thus, in some embodiments, the protein expression host system is a plant, for 
example, a duckweed, and the purified glycoprotein obtained from the plant host is 

10 substantially free of plant cellular material, including embodiments where the 

preparations of glycoprotein have less than about 30%, 20%, 10%, 5%, or 1% (by dry 
weight) of contaminating plant protein. In other embodiments, the plant culture 
medium represents less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of 
chemical precursors or non-protein-of-interest chemicals within the purified 

15 glycoprotein. 

In some embodiments, these purified glycoproteins obtained from the plant 
host can include at least 0.001%, 0.005%, 0.1%, 0.5%. 1%. 1.5%, 2%, 2.5%, 3%, 
3.5%. 4%, 4.5%, 5%, 5.5%, 6%. 6.5%, 7%, 7.5%, 8%. 8.5%. 9%, 9.5%, 10%, 15%, 
20%, 25%, or up to about 30% (by dry weight) of contaminating plant protein. In 

20 other embodiments, plant culture medium components within in these purified 

glycoproteins can represent at least 0.001%. 0.005%, 0. 1%, 0.5%, 1%. 1 .5%. 2%. 
2.5%. 3%, 3.5%, 4%, 4.5%, 5%, 5.5%, 6%, 6.5%, 7%, 7.5%, 8%, 8.5%, 9%, 9.5%, 
10%, 15%, 20%, 25%, or up to about 30% (by dry weight) of chemical precursors or 
non-protein-of-interest chemicals within the purified glycoprotein. In some 

25 embodiments, isolation and purification from the plant host results in recovery of 

purified glycoprotein that is free of contaminating plant protein, free of plant culture 
medium components, and/or free of both contaminating plant protein and plant culture 
medium components. 

30 Expression Cassettes 

According to the present invention, stably transformed higher plants, for 
example, stably transformed duckweed, are obtained by transformation with a 
polynucleotide of interest contained within an expression cassette. Depending upon 
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the objective, the polynucleotide of interest can be one encoding a FucT or XylT 
polypeptide of interest, for example, encoding the polypeptide set forth in SEQ ID 
N0:3 (FucT) or SEQ ID NO:6 or 21 (XylT), or a variant thereof, thus providing for 
expression of these polypeptides in a cell, for example, a plant cell, or can be a FucT 
5 or XylT inhibitory polynucleotide that is capable of inhibiting expression or function 

of the FucT or XylT polypeptide when stably introduced into a cell, for example, a 
plant cell of interest. 

Thus, in some embodiments, the FucT and/or XylT polynucleotides of the 
invention, including those set forth in SEQ ID NOS: 1 and 2 (FucT) and SEQ ID 

10 N0S:4, 5, 19, and 20 (XylT) and fragments and variants thereof, are used to construct 

expression cassettes that comprise a FucT and/or XylT inhibitory polynucleotide as 
defined herein above. Stably introducing such an expression cassette into a plant or 
plant cell of interest can provide for inhibition of expression or function of the FucT 
and/or XylT polypeptides of the invention, including those set forth in SEQ ID N0:3 

1 5 (FucT) and SEQ ID N0:6 or 2 1 (XylT) and variants thereof, thereby altering the A^- 

glycan glycosylation pattern of endogenous and heterologous glycoproteins within a 
plant or plant cell stably transformed with the expression cassette. 

In some embodiments, the plant or plant cell that is stably transformed with an 
expression cassette comprising a FucT and/or XylT inhibitory polynucleotide has also 

20 been stably transformed with an expression cassette that provides for expression of a 

heterologous ix)lypeptide of interest, for example, a mammalian protein of interest, 
including the mammalian proteins of pharmaceutical interest as noted herein above. 
The expression cassette providing for expression of a heterologous polypeptide of 
interest can be provided on the same polynucleotide (for example, on the same 

25 transformation vector) for introduction into a plant, or on a different polynucleotide 

(for example, on different transformation vectors) for introduction into the plant or 
plant cell of interest at the same time or at different times, by the same or by different 
methods of introduction, for example, by the same or different transformation 
methods. 

30 The expression cassettes of the present invention comprise expression control 

elements that at least comprise a transcriptional initiation region (e.g., a promoter) 
linked to the polynucleotide of interest, i.e., a polynucleotide encoding a FucT or 
XylT polypeptide of the invention, a FucT and/or XylT inhibitory polynucleotide, or a 
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polynucleotide encoding a heterologous polypeptide of interest, for example, a 
mammalian protein. Such an expression cassette is provided with a plurality of 
restriction sites for insertion of the polynucleotide or polynucleotides of interest (e.g., 
one polynucleotide of interest, two polynucleotides of interest, etc.) to be under the 
S transcriptional regulation of the promoter and other expression control elements. In 

particular embodiments of the invention, the polynucleotide to be transferred contains 
two or more expression cassettes, each of which encodes at least one polynucleotide 
of interest. 

By "expression control element" is intended a regulatory region of DNA, 

10 usually comprising a TATA box, capable of directing RNA polymerase II, or in some 

embodiments, RNA polymerase III, to initiate RNA synthesis at the appropriate 
transcription initiation site for a particular coding sequence. An expression control 
element may additionally comprise other recognition sequences generally positioned 
upstream or 5' to the TATA box, which influence {e.g., enhance) the transcription 

15 initiation rate. Furthermore, an expression control element may additionally comprise 

sequences generally positioned downstream or 3' to the TATA box, which influence 
{e.g., enhance) the transcription initiation rate. 

The transcriptional initiation region (e.g., a promoter) may be native or 
homologous or foreign or heterologous to the host, or could be the natural sequence or 

20 a synthetic sequence. By foreign, it is intended that the transcriptional initiation 

region is not found in the wild-type host into which the transcriptional initiation 
region is introduced. By "functional promoter" is intended the promoter, when 
operably linked to a sequence encoding a protein of interest, is capable of driving 
expression (i.e., transcription and translation) of the encoded protein, or, when 

25 operably linked to an inhibitory sequence encoding an inhibitory nucleotide molecule 

(for example, a hairpin RNA, double-stranded RNA, miRNA polynucleotide, and the 
like), the promoter is capable of initiating transcription of the operably linked 
inhibitory sequence such that the inhibitory nucleotide molecule is expressed. The 
promoters can be selected based on the desired outcome. Thus the expression 

30 cassettes of the invention can comprise constitutive, tissue-preferred, or other 

promoters for expression in plants. 

As used herein a chimeric gene comprises a coding sequence operably linked 
to a transcription initiation region that is heterologous to the coding sequence. 
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Any suitable promoter known in the art can be employed according to the 
present invention, including bacterial, yeast, fungal, insect, mammalian, and plant 
promoters. For example, plant promoters, including duckweed promoters, may be 
used. Exemplary promoters include, but are not limited to, the Cauliflower Mosaic 
5 Virus 35S promoter, the opine synthetase promoters (e.g., nos, mas, ocs, etc.), the 

ubiquitin promoter, the actin promoter, the ribulose bisphosphate (RubP) carboxylase 
small subunit promoter, and the alcohol dehydrogenase promoter. The duckweed 
RubP carboxylase small subunit promoter is known in the art (Silverthome et al 
(1990) Plant MoL Biol 15:49). Other promoters from viruses that infect plants, 

10 preferably duckweed, are also suitable including, but not limited to, promoters 

isolated from Dasheen mosaic virus, Chlorella virus (e.g., the Chlorella virus adenine 
methyltransferase promoter; Mitra et al (1994) Plant Mol Biol 26:85), tomato 
spotted wilt virus, tobacco rattle virus, tobacco necrosis virus, tobacco ring spot virus, 
tomato ring spot virus, cucumber mosaic virus, peanut stump virus, alfalfa mosaic 

15 virus, sugarcane baciliform badnavirus and the like. 

Other suitable expression control elements are disclosed in the commonly 
owned and copending provisional application entitled ''Expression Control Elements 
from the Lemnaceae Family^'' assigned U.S. Patent Application No. 60/759,308, 
Attorney Docket No. 040989/243656, filed January 17, 2006, herein incorporated by 

20 reference in its entirety. The expression control elements disclosed in this copending 

application were isolated from ubiquitin genes for several members of the Lemnaceae 
family, and are thus referred to as ''Lemnaceae ubiquitin expression control 
elements." SEQ ID NO: 7 of the present application sets forth the full-length Lemna 
minor ubiquitin expression control element, including both the promoter plus 5' UTR 

25 (nucleotides 1-1625) and intron (nucleotides 1626-2160). SEQ ID N0:8 sets forth the 

full-length Spirodella polyrrhiza ubiquitin expression control element, including both 
the promoter plus 5' UTR (nucleotides 1-1041) and intron (nucleotides 1042-2021). 
SEQ ID N0:9 sets forth the full-length Lemna aequinoctialis ubiquitin expression 
control element, including both the promoter plus 5' UTR (nucleotides 1-964) and 

30 intron (nucleotides 965-2068). SEQ ID NO: 10 sets forth the promoter plus 5* UTR 

portion of the L. minor ubiquitin expression control element (designated "LmUbq 
promoter" herein). SEQ ID NO: 1 1 sets forth the promoter plus 5' UTR portion of the 
S. polyrrhiza ubiquitin expression control element (designated "SpUbq promoter" 
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herein). SEQ ID NO: 1 2 sets forth the promoter plus 5' UTR portion of the L 
aequinoctialis ubiquitin expression control element (designated "LaUbq promoter" 
herein). SEQ ID NO: 13 sets forth the intron portion of the L minor ubiquitin 
expression control element (designated "LmUbq intron" herein). SEQ ID NO: 1 4 sets 
5 forth the intron portion of the S. polyrrhiza ubiquitin expression control element 

(designated "SpUbq intron" herein). SEQ ID NO: 15 sets forth the intron portion of 
the L aequinoctialis ubiquitin expression control element (designated "LaUbq intron' 
herein). It is recognized that the individual promoter plus 5' UTR sequences set forth 
in SEQ ID NOs: 10-12, and biologically active variants and fragments thereof, can be 

10 used to regulate transcription of operably linked nucleotide sequences of interest in 

plants. Similarly, one or more of the intron sequences set forth in SEQ ID NOs: IS- 
IS, and biologically active fragments or variants thereof, can be operably linked to a 
promoter of interest, including a promoter set forth in SEQ ID NO: 10, 1 1, or 12 in 
order to enhance expression of a nucleotide sequence that is operably linked to that 

15 promoter. 

Fragments and variants of the disclosed expression control elements can also 
be used within expression cassettes to drive expression of the operably linked 
polynucleotide of interest By "fragment of an expression control element" is 
intended a portion of the full-length expression control element, such as a portion of 

20 any one of the expression control elements set forth in SEQ ID NOs: 7-9. Fragments 

of an expression control element retain biological activity and hence encompass 
fragments capable of initiating or enhancing expression of an operably linked 
polynucleotide of interest Thus, for example, less than the entire expression control 
elements disclosed herein may be utilized to drive expression of an operably linked 

25 polynucleotide of interest Specific, non-limiting examples of such fragments of an 

expression control element inclijde the nucleotide sequences set forth in any one of 
SEQ ID NOs: 10- 12 (as described herein above), as well as 5' truncations of the L 
minor ubiquitin expression control element (SEQ ID NO: 7), such as nucleotides 
1288-2160 of SEQ ID NO: 7 (LmUbq truncated promoter No. 1) and nucleotides 

30 1132-21 60 of SEQ ID NO: 1 (LmUbq truncated promoter No. 2). See the copending 

provisional application assigned U.S. Patent Application No. 60/759,308, herein 
incorporated by reference in its entirety. 
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The nucleotides of such fragments will usually comprise the TATA 
recognition sequence of the particular expression control element. Such fragments 
can be obtained by use of restriction enzymes to cleave the naturally occurring 
expression control elements disclosed herein; by synthesizing a nucleotide sequence 
5 from the naturally occurring sequence of the expression control element DNA 

sequence; or can be obtained through the use of polymerase chain reaction (PCR) 
technology. See particularly, MuUis e/ a/. Methods Enzymol 155:335-350, 

and Eriich, ed. (1989) PCR Technology (Stockton Press, New York). 

Variants of expression control elements, such as those resuhing from site- 

10 directed mutagenesis, can also be used in the expression cassettes of the present 

invention to provide expression of the operably linked polynucleotide of interest. By 
"variant of an expression control element" is intended sequences having substantial 
similarity with an expression control element disclosed herein (for example, the 
expression control element set forth in SEQ ID N0:7, 9, or 9), or with a fragment 

15 thereof (for example, the respective sequences set forth in SEQ ID NOs: 10-1 5). 

Naturally occurring variants of expression control elements can be identified with the 
use of well-known molecular biology techniques, as, for example, with PCR and 
hybridization techniques as outlined above. Variant expression control elements also 
include synthetically derived nucleotide sequences, such as those generated, for 

20 example, by using site-directed mutagenesis. Generally, variants of a particular 

expression control element disclosed herein, including variants of SEQ ID NOs:7-15, 
will have at least 40%, 50%, 60%, 65%, 70%, generally at least 75%, 80%, 85%, 
preferably about 90%, 91%, 92%, 93%, 94%. to 95%, 96%, 97%, and more 
preferably about 98%, 99% or more sequence identity to that particular nucleotide 

25 sequence as determined by sequence alignment programs described herein above 

using default parameters. 

Expression control elements, including promoters, can be chosen to give a 
desired level of regulation. For example, in some instances, it may be advantageous 
to use a promoter that confers constitutive expression (e.g, the mannopine synthase 

30 promoter from Agrobacterium tumefaciens). Alternatively, in other situations, for 

example, where expression of a heterologous polypeptide is concemed, it may be 
advantageous to use promoters that are activated in response to specific 
environmental stimuh (e.g., heat shock gene promoters, drought-inducible gene 
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promoters, pathogen-inducible gene promoters, wound-inducible gene promoters, and 
light/dark-inducible gene promoters) or plant growth regulators (e.g., promoters from 
genes induced by abscissic acid, auxins, cytokinins, and gibberellic acid). As a 
further alternative, promoters can be chosen that give tissue-specific expression (e.g., 
5 root, leaf, and floral-specific promoters). 

The overall strength of a given promoter can be influenced by the combination 
and spatial organization of cis-acting nucleotide sequences such as upstream 
activating sequences. For example, activating nucleotide sequences derived from the 
Agrobacterium tumefaciens octopine synthase gene can enhance transcription from 

10 the Agrobacterium tumefaciens mannopine synthase promoter (see U.S. Patent 

5,955,646 to Gelvin et al). In the present invention, the expression cassette can 
contain activating nucleotide sequences inserted upstream of the promoter sequence to 
enhance the expression of the nucleotide sequence of interest. In one embodiment, 
the expression cassette includes three upstream activating sequences derived from the 

1 5 Agrobacterium tumefaciens octopine synthase gene operably linked to a promoter 

derived from an Agrobacterium tumefaciens mannopine synthase gene (see U.S 
Patent 5,955,646, herein incorporated by reference). 

Where the expression control element will be used to drive expression of an 
operably linked DNA sequence encoding a small hpRNA molecule, for example, 

20 within an RNAi expression cassette described herein above, it is advantageous to use 

an expression control element comprising a promoter recognized by the DNA 
dependent RNA polymerase 111. As used herein, "a promoter recognized by the DNA 
dependent RNA polymerase III" is a promoter which directs transcription of the 
associated DNA region through the polymerase action of RNA polymerase HI. These 

25 include genes encoding 5S RNA, tRNA, 7SL RNA, U6 snRNA and a few other small 

stable RNAs, many involved in RNA processing. Most of the promoters used by Pol 
III require sequence elements downstream of +1, within the transcribed region. A 
minority of pol HI templates however, lack any requirement for intragenic promoter 
elements. These are referred to as type 3 promoters. By "type 3 Pol HI promoters" is 

30 intended those promoters that are recognized by RNA polymerase III and contain all 

cis-acting elements, interacting with the RNA polymerase 111 upstream of the region 
normally transcribed by RNA polymerase III. Such type 3 Pol HI promoters can be 
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assembled within the RNAi expression cassettes of the invention to drive expression 
of the operably linked DNA sequence encoding the small hpRNA molecule. 

Typically, type 3 Pol m promoters contain a TATA box (located between -25 
and -30 in Human U6 snRNA gene) and a Proximal Sequence element (PSE; located 
5 between -47 and -66 in Human U6 snRNA). They may also contain a Distal Sequence 

Element (DSE; located between -214 and -244 in Human U6 snRNA). Type 3 Pol HI 
promoters can be found, e.g., associated with the genes encoding 7SL RNA, U3 
snRNA and U6 snRNA. Such sequences have been isolated from Arabidopsis, rice, 
and tomato. See, for example, SEQ ID N0s:l-8 of U.S. Patent Application 

1 0 Publication No. 2004023 1016. 

Other nucleotide sequences for type 3 Pol in promoters can be found in 
nucleotide sequence databases under the entries for the A. thaliana gene AT7SL-1 for 
7SL RNA (X72228), A. thaliana gene AT7SL-2 for 7SL RNA (X72229), A, thaliana 
gene AT7SL-3 for 7SL RNA (AJ290403), Humulus lupulus H17SL-1 gene 

15 (AJ236706), Humulus lupulus H17SL-2 gene (AJ236704), Humulus lupulus H17SL-3 

gene (AJ236705), Humulus lupulus H17SL-4 gene (AJ236703), A, thaliana U6-1 
snRNA gene (X52527), A. thaliana U6-26 snRNA gene (X52528), A, thaliana U6-29 
snRNA gene (X52529), A, thaliana U6-1 snRNA gene (X52527). Zea mays U3 
snRNA gene (Z29641), Solanum tuberosum U6 snRNA gene (Z17301; X 60506; 

20 S83742), tomato U6 smal nuclear RNA gene (X51447), A, thaliana U3C snRNA gene 

(X52630). A, thaliana U3B snRNA gene (X52629), Oryza sativa U3 snRNA 
promoter (X79685), tomato U3 small nuclear RNA gene (X1441 1), Triticum aestivum 
U3 snRNA gene (X63065), and Triticum aestivum U6 snRNA gene (X63066). 

Other type 3 Pol HI promoters may be isolated from other varieties of tomato, 

25 rice or Arabidopsis, or from other plant species using methods well known in the art. 

For example, libraries of genomic clones from such plants may be isolated using U6 
snRNA, U3 snRNA, or 7SL RNA coding sequences (such as the coding sequences of 
any of the above mentioned sequences identified by their accession number and 
additionally the Vicia faba U6snRNA coding sequence (X04788), the maize DNA for 

30 U6 snRNA (X523 1 5), or the maize DNA for 7SL RNA (XI 4661)) as a probe, and the 

upstream sequences, preferably the about 300 to 400 bp upstream of the transcribed 
regions may be isolated and used as type 3 Pol HI promoters. Alternatively, PGR 
based techniques such as inverse-PCR or TAIL^^-PCR may be used to isolate the 
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genomic sequences including the promoter sequences adjacent to known transcribed 
regions. Moreover, any of the type 3 Pol in promoter sequences described herein, 
identified by their accession numbers and SEQ ID NOS, may be used as probes under 
stringent hybridization conditions or as source of information to generate PGR 
primers to isolate the corresponding promoter sequences from other varieties or plant 
species. 

Although type 3 Pol EI promoters have no requirement for cis-acting elements 
located with the transcribed region, it is clear that sequences normally located 
downstream of the transcription initiation site may nevertheless be included in the 
RNAi expression cassettes of the invention. Further, while type 3 Pol III promoters 
originally isolated from monocotyledonous plants can effectively be xised in RNAi 
expression cassettes to suppress expression of a target gene in both dicotyledonous 
and monocotyledonous plant cells and plants, type 3 Pol HI promoters originally 
isolated from dicotyledonous plants reportedly can only be efficiently used in 
dicotyledonous plant cells and plants. Moreover, the most efficient gene silencing 
reportedly is obtained when the RNAi expression cassette is designed to comprise a 
type 3 Pol III promoter derived from the same or closely related species. See, for 
example. U.S. Patent Application Publication No. 2004023 1016. Thus, where the 
plant of interest is a monocotyledonous plant, and small hpRNA interference is the 
method of choice for inhibiting expression of FucT and/or XylT, the type 3 Pol in 
promoter preferably is from another monocotyledonous plant, including the plant 
species for which the glycosylation pattern of A^-linked glycans of a glycoprotein of 
interest is to be altered. 

The expression cassette of the invention thus includes in the 5'-3* direction of 
transcription, an expression control element comprising a transcriptional and 
translational initiation region, a polynucleotide of interest, for example, a sequence 
encoding a heterologous protein of interest or a sequence encoding a FucT or XylT 
inhibitory sequence that, when expressed, is capable of inhibiting the expression or 
function of FucT and/or XylT, and a transcriptional and translational termination 
region functional in plants. Any suitable termination sequence known in the art may 
be used in accordance with the present invention. The termination region may be 
native with the transcriptional initiation region, may be native with the nucleotide 
sequence of interest, or may be derived from another source. Convenient termination 
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regions are available from the Ti-plasmid of A. tumefaciens, such as the octopine 
synthetase and nopaline synthetase termination regions. See also Guerineau et al 
(1991)Mo/. Gen, Genet. 262:141; Proudfoot (1991) Ce// 64:671; Sanfacon a/. 
(1991) Genes Dev. 5:141; Mogen et al (1990) Plant Cell 2:1261; Munroe et al 
5 (1990)G^/2e91:151;Ballase/a/. (im) Nucleic Acids Res. 17:7891; and Joshi a/. 

(1987) Nucleic Acids Res, 15:9627. Additional exemplary termination sequences are 
the pea Rub? carboxylase small subunit termination sequence and the Cauliflower 
Mosaic Virus 35S termination sequence. Other suitable termination sequences will be 
apparent to those skilled in the art, including the oligo dT stretch disclosed herein 

10 above for use with type 3 Pol 111 promoters driving expression of a FucT and/or XlyT 

inhibitoiy polynucleotide that forms a small hpRNA structure. 

Alternatively, the polynucleotide(s) of interest can be provided on any other 
suitable expression cassette known in the art. 

Generally, the expression cassette will comprise a selectable marker gene for 

1 5 the selection of transformed cells or tissues. Selectable marker genes include genes 

encoding antibiotic resistance, such as those encoding neomycin phosphotransferase 
TI (NEO) and hygromycin phosphotransferase (HPT), as well as genes conferring 
resistance to herbicidal compounds. Herbicide resistance genes generally code for a 
modified target protein insensitive to the herbicide or for an enzyme that degrades or 

20 detoxifies the herbicide in the plant before it can act. See DeBlock et al (1987) 

EMBO J, 6:25 1 3 ; DeBlock et al ( 1 989) Plant Physiol 9 1 :69 1 ; Fromm et al ( 1 990) 
BioTechnology 8:833; Gordon-Kamm et al (1990) Plant Cell 2:603. For example, 
resistance to gl3^hosphate or sulfonylurea herbicides has been obtained using genes 
coding for the mutant target enzymes, 5-enolpyruvylshikimate-3 -phosphate synthase 

25 (EPSPS) and acetolactate synthase (ALS). Resistance to glufosinate ammonium, 

boromoxynil, and 2,4-dichIorophenoxyacetate (2,4-D) have been obtained by using 
bacterial genes encoding phosphinothricin acetyltransfcrase, a nitrilase, or a 2,4- 
dichlorophenoxyacetate monooxygenase, which detoxify the respective herbicides. 

For purposes of the present invention, selectable marker genes include, but are 

30 not limited to, genes encoding neomycin phosphotransferase TI (Fraley et al (1 986) 

CRC Critical Reviews in Plant Science 4:1); cyanamide hydratase (Maier-Greiner 
al (1991) Ptoc. Natl Acad, ScL USA 88:4250); aspartate kinase; dihydrodipicolinate 
synthase (Peri et al (1993) BioTechnology 1 1:715); bar gene (Toki et al (1992) Plant 
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Physiol 100:1503; Meagher al (1996) Crop ScL 36:1367); tryptophan 
decarboxylase (Goddijn et al (1993) Plant Mol Biol. 22:907); neomycin 
phosphotransferase (NEO; Southern et al (1982) J. Mol Appl Gen. 1:327); 
hygromycin phosphotransferase (HPT or HYG; Shimizu et al (1986) Mol Cell Biol 
5 6: 1074); dihydrofolate reductase (DHFR; Kwok et al (1986) Proc. Natl Acad. Scl 

USA 83:4552); phosphinothricin acetyltransferase PeBlock et al (1987) EMBOJ, 
6:25 13); 2,2-dichloropropionic acid dehalogenase (Buchanan- WoUatron et al (1989) 
y. Cell Biochem. 13D:330); acetohydroxyacid synthase (U.S. Pat. No. 4,761,373 to 
Anderson et al; Haughn et al (1988) Mol Gen. Genet. 221:266); 5-enolpyruvyl- 

10 shikimate-phosphate synthase (aroA; Comai et al (1985) Nature 3 17:741); 

haloarylnitrilase (WO 87/04181 to Stalker et al); acetyl-coenzyme A carboxylase 
(Parker et al (1990) Plant Physiol 92: 1220); dihydropteroate synthase (sull; 
Guerineau et al (1 990) Plant Mol Biol 1 5: 127); and 32 kDa photosystem II 
polypeptide (psbA; Hirschberge/ al (1983) Science 222:1346 (1983), 

15 Also included are genes encoding resistance to: gentamycin (e.g., aacCl, 

Wohlleben et al (1989) Mol Gen. Genet. 217:202-208); chloramphenicol (Herrera- 
Estrella et al (1983) EMBOJ. 2:987); methotrexate (Herrera-Estrella et al (1983) 
Nature 303:209; Meijer e/a/. (1991) Plant Mol Biol 16:807); hygromycin (Waldron 
et al (mS) Plant Mol Biol 5:103; Zhijian etal (1995) Plant Science 108:219; 

20 Meijer et al (1991) Plant Mol Bio. 1 6:807); streptomycin (Jones et al (1 987) Mol 

Gen. Genet. 210:86); spectinomycin (Bretagne-Sagnard a/. (1996) Transgenic Res. 
5:131); bleomycin (Hille etal {\9U) Plant Mol Biol 7:171); sulfonamide 
(Guerineau et al (1990) Plant Mol Bio. 15:127); bromoxynil (Stalker et al (1988) 
Science 242:419); 2,4-D (Streber et al (1989) BioTechnology 7:81 1); 

25 phosphinothricin (DeBlock et al (1 987) EMBO J. 6:25 1 3); spectinomycin (Bretagne- 

Sagnard and Chupeau, Transgenic Research 5:131). 

The bar gene confers herbicide resistance to glufosinate-type herbicides, such 
as phosphinothricin (PPT) or bialaphos, and the like. As noted above, other selectable 
markers that could be used in the vector constructs include, but are not limited to, the 

30 pat gene, also for bialaphos and phosphinothricin resistance, the ALS gene for 

imidazolinone resistance, the HPH or HYG gene for hygromycin resistance, the EPSP 
synthase gene for glyphosate resistance, the Hml gene for resistance to the Hc-toxin, 
and other selective agents used routinely and known to one of ordinary skill in the art. 
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See Yarranton (1992) Cum Opin, Biotech. 3:506; Chistopherson et aL (1992) Proa 
Natl. Acad. Sci. USA 89:6314; Yao et al. (1992) Ceini:63; Reznikoff (1992) MoL 
Microbiol 6:2419; Barkley et al (1980) The Operon 177-220; Hu et al (1987) Cell 
48:555; Brown et al (1987) Cell 49:603; Figge et al (1 988) Cell 52:71 3; Deuschle et 
5 al (1989) Proc, Natl Acad. Scl USA 86:5400; Fuerst et al (1989) Proa Natl Acad 

ScL USA 86:2549; Deuschle et al (1990) Science 248:480; Labow et al (1990) Mol 
Cell Biol 10:3343; Zambretti et al (1992) Proc. Natl Acad. Scl USA 89:3952; Bairn 
et a/. (1991) Proc. Natl Acad. ScL USA 88:5072; Wyborski et a/. (1991) Nuc. Acids 
Res. 19:4647; Hillenand-Wissman (1989) Topics in Mol AndStruc. Biol. 10: 143; 

10 Degenkolb et al (\99 1) Antimicrob. Agents Chemother. 35:1591; Kleinschnidt et al. 

(1988) Biochemistry 27: 1094; Gatz et al (1992) Plant J. 2:397; Gossen et al. (1992) 
Proc. Natl Acad. Scl USA 89:5547; OVw^etal (1992) Antimicrob. Agents 
Chemother. 36:913; Hlavka et al (1985) Handbook of Experimental Pharmacology 
78; and Gill et al (1988) Nature 334:721. Such disclosures are herein incorporated by 

15 reference. 

The above list of selectable marker genes are not meant to be limiting. Any 
selectable marker gene can be used in the present invention. 

Modification of Nucleotide Sequences for Enhanced Expression in a Plant Host 
20 Where the plant of interest is also genetically modified to express a 

heterologous protein of interest, for example, a transgenic plant host serving as an 
expression system for recombinant production of a heterologous protein, the present 
invention provides for the modification of the expressed polynucleotide sequence 
encoding the heterologous protein of interest to enhance its expression in the host 
25 plant. Thus, where appropriate, the polynucleotides may be optimized for increased 

expression in the transformed plant. That is, the polynucleotides can be synthesized 
using plant-preferred codons for improved expression. See, for example, Campbell 
and Gowri (1990) Plant Physiol 92:1-1 1 for a discussion of host-preferred codon 
usage. Methods are available in the art for synthesizing nucleotide sequences with 
30 plant-preferred codons. See, e.g., U.S. Patent Nos. 5,380,83 1 and 5,436,391 ; Perlak et 

al (1991) Proc. Natl Acad. Scl USA 15:3324; lannacome et al (1997) Plant Mol 
Biol 34:485; and Murray et al, (1989) Nucleic Acids. Res. XlAll, herein 
incorporated by reference. 
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In some embodiments of the invention, the plant host is a member of the 
duckweed family, and the polynucleotide encoding the heterologous polypeptide of 
interest, for example, a mammalian polypeptide, is modified for enhanced expression 
of the encoded heterologous polypeptide. In this manner, one such modification is the 
5 synthesis of the polynucleotide encoding the heterologous polypeptide of interest 

using duckweed-preferred codons, where synthesis can be accomplished using any 
method known to one of skill in the art. The preferred codons may be determined 
from the codons of highest frequency in the proteins expressed in duckweed. For 
example, the frequency of codon usage for Lemna gibba is found on the web page: 

10 http://www.kazusa.or jp/codon/cgi-bin/showcodon.cgi?species= 

Lemna+gibba+[gbpln], and the frequency of codon usage for Lemna minor is found 
on the web page http://www.kazusa.or.jp/codon/cgibin/showcodon.cgi?species= 
Lemna+minon-[gbpln] and in Table 1. It is recognized that heterologous genes that 
have been optimized for expression in duckweed and other monocots, as well as other 

15 dicots, can be used in the methods of the invention. See, e.g., EP 0 359 472, EP 0 385 

962, WO 91/16432; Perlak et al (1991) Proc, Natl. Acad. Scl USA 88:3324; 
lannacome et al. (1997) Plant Mol. Biol. 34:485; and Murray et al (1989) Nuc. Acids 
Res. 17:477, and the like, herein incorporated by reference. It is further recognized 
that all or any part of the polynucleotide encoding the heterologous polypeptide of 

20 interest may be optimized or synthetic. In other words, fully optimized or partially 

optimized sequences may also be used. For example, 40 %, 45%, 50%, 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 87%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99%, or 100% of the codons may be duckweed-preferred codons. In one 
embodiment, between 90 and 96 % of the codons are duckweed-preferred codons. 

25 The coding sequence of a polynucleotide sequence encoding a heterologous 

polypeptide of interest may comprise codons used with a frequency of at least 17% in 
Lemna gibba. In one embodiment, the modified nucleotide sequence is the human a- 
2B-interferon encoding nucleotide sequence shown in SEQ ID NO: 16, which contains 
93% duckweed preferred codons. 

30 

Table 1: Lemna gibba-prekrrti codons from GenBank Release 113 

UUU 2.2 ( 4) UCU 0.5 ( I) UAU 2.2 ( 4) UGU 0.0 ( 0) 
UUC 50.5 ( 92) UCC31.9 ( 58) UAC40.1 ( 73) UGCI7.6 ( 32) 
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10 



15 



UUA 0,0 


( 0) 


UCA 0.5 ( 


1) 


UAA 3.8 ( 


: 7) 


UGA 1.6 ( 


3) 


UUG 2.7 


( 5) 


UCG 15.4 ( 


28) 


UAG 0.0 { 


: 0) 


UGG24.2 ( 


44) 


L.UU V.J 




ecu 6.6 ( 


12) 


CAU 0.5 1 


. 1) 


CGU 1.1 ( 


2) 


cue 39.0 


( 71) 


CCC43.4 ( 


79) 


CAC 6.6 ( 


: 12) 


CGC26.9 ( 


49) 


CUA l.l 


( 2) 


CCA 2.2 ( 


4) 


CAA 4.4 { 


: 8) 


CGA l.l ( 


2) 


CUG 22.5 


( 41) 


CCG20.9 ( 


38) 


CAG26.9 { 


: 49) 


CGG 7.7 ( 


14) 


AUU 0.0 


( 0) 


ACU 3.3 ( 


6) 


AAU 1.1 ( 


: 2) 


AGU 0.0 ( 


0) 


AUC 33.5 


( 61) 


ACC26.4 ( 


48) 


AAC37.9 < 


: 69) 


AGC22.0 ( 


40) 


AUA 0.0 


( 0) 


ACA 0.5 ( 


1) 


AAA 0.0 ( 


: 0) 


AGA 4.9 ( 


9) 


AUG 33.5 


( 61) 


ACG 9.3 ( 


17) 


AAG57.1 ( 


; 104) 


AGG 6.0 ( 


11) 


GUU 9.3 


( 17) 


GCU 7.1 ( 


13) 


GAU 1.6 ( 


: 3) 


GGU 1.1 ( 


2) 


GUC 28.0 


( 51) 


GCC73.6 ( 


134) 


GAC38.4 ( 


: 70) 


GGC46.7 ( 


85) 


GUA 0.0 


( 0) 


GCA 5.5 ( 


10) 


GAA 2.2 ( 


: 4) 


GGA 1.1 ( 


2) 


GUG 34.0 


( 62) 


GCG20.9 ( 


38) 


GAG 62.6 { 


; 114) 


GGG27.5 ( 


50) 
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Other modifications can also be made to the polynucleotide encoding the 
heterologous polypeptide of interest to enhance its expression in a plant host of 
interest, including duckweed. These modifications include, but are not limited to, " 
elimination of sequences encoding spurious polyadenylation signals, exon-intron 
splice site signals, transposon-like repeats, and other such well characterized 
sequences which may be deleterious to gene expression. The G-C content of the 
sequence may be adjusted to levels average for a given cellular host, as calculated by 
reference to known genes expressed in the host cell. When possible, the 
polynucleotide encoding the heterologous polypeptide of interest may be modified to 
avoid predicted hairpin secondary m£^A structures. 

There are known differences between the optimal translation initiation context 
nucleotide sequences for translation initiation codons in animals and plants and the 
composition of these translation initiation context nucleotide sequences can influence 
the efficiency of translation initiation. See, for example, Lukaszewicz et al. (2000) 
Plant Science 154:89-98; and Joshi et al (1997); Plant MoL Biol 35:993-1001. In 
the present invention, the translation initiation context nucleotide sequence for the 
translation initiation codon of the polynucleotide nucleotide of interest, for example, 
the polynucleotide encoding a heterologous polypeptide of interest, may be modified 
to enhance expression in duckweed. In one embodiment, the nucleotide sequence is 
modified such that the three nucleotides directly upstream of the translation initiation 
codon of the nucleotide sequence of interest are " ACC." In a second embodiment, 
these nucleotides are "ACA." 
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Expression of a transgene in a host plant, including duckweed, can also be 
enhanced by the use of 5' leader sequences. Such leader sequences can act to enhance 
translation. Translation leaders are known in the art and include, but are not limited 
to, picomavirus leaders, e.g., EMCV leader (Encephalomyocarditis 5' noncoding 
5 region; Elroy-Stein et al (1989) Proc, Natl Acad. Sci USA 86:6126); potyvirus 

leaders, e.g., TEV leader (Tobacco Etch Virus; Allison et al (1986) Virology 154:9); 
human immunoglobulin heavy-chain binding protein (BiP; Macajak and Samow 
(1991) Nature 353:90); untranslated leader from the coat protein mRNA of alfalfa 
mosaic virus (AMV RNA 4; Jobling and Gehrke (1987) Nature 325:622); tobacco 

10 mosaic virus leader (TMV; Gallic (1989) Molecular Biology of RNA, 23:56); potato 

etch virus leader (Tomashevskaya et al (1993) J. Gen. Virol 74:2717-2724); Fed-1 5* 
untranslated region (Dickey (1 992) EMBO 7. 1 1 :23 1 1 -23 1 7); RbcS 5' untranslated 
region (Silverthome et al (1990)7. Plant, Mol Biol 15:49-58); and maize chlorotic 
mottle virus leader (MCMV; Lommel el a/. (1991) Virology 81 :382). See also, Della- 

15 Cioppa et al (1987) Plant Physiology 84:965. Leader sequence comprising plant 

intron sequence, including intron sequence from the maize alcohol dehydrogenase 1 
(ADHl) gene, the castor bean catalase gene, or the Arabidopsis tryptophan pathway 
gene PATl has also been shown to increase translational efficiency in plants (Callis et 
al (1987) Genes Dev. 1:1 183-1200; Mascarenhas a/. (1990) P/a«/Afo/. i5/o/. 

20 15:913-920). See also copending provisional application U.S. Patent Application No. 

60/759,308, wherein leader sequence comprising a duckweed intron sequence 
selected from the group consisting of the introns set forth in SEQ ID NOs: 13-15 
provides for increased translational efficiency in duckweed. 

In some embodiments of the present invention, nucleotide sequence 

25 corresponding to nucleotides 1222-1775 of the maize alcohol dehydrogenase 1 gene 

(ADHl; GenBank Accession Number X04049), or nucleotide sequence 
corresponding to the intron set forth in SEQ ID N0:13, 14, or 15, is inserted upstream 
of the polynucleotide encoding the heterologous polypeptide of interest or the FucT 
and/or XylT inhibitory polynucleotide to enhance the efficiency of its translation. In 

30 another embodiment, the expression cassette contains the leader from the Lemna 

gibba ribulose-bis-phosphate carboxylase small subunit 5B gene (RbcS leader; see 
Buzby et al (1990) Plant Cell 2:805-814; also see SEQ ID N0:16, 17, or 18 of the 
present invention). 
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See also, by way of example only, the expression vectors disclosed in the 
figures herein, wherein the RbcS leader and ADHl intron are included as upstream 
regulatory sequences within an expression cassette comprising the FucT inhibitory 
polynucleotide (Figure 8), the XylT inhibitory polynucleotide (Figures 9 and 1 1), an 
5 expression cassette comprising the chimeric FucT/XylT inhibitory molecule (Figure 

10), or an expression cassette comprising the coding sequence for the heterologous 
polypeptide, the IgGl heavy chain of a monoclonal antibody (Figures 12, 13, and 14) 
or the light chain of a monoclonal antibody (Figure 14); wherein the LmUbq promoter 
and LmUbq intron are included as upstream regulatory sequences within an 

10 expression cassette comprising the FucT inhibitory polynucleotide (Figure 1 1), or an 

expression cassette comprising the coding sequence for the heterologous polypeptide, 
the IgFl light chain of a monoclonal antibody (Figure 13); wherein the SpUbq 
promoter and SpUbq intron are included as upstream regulatory sequences within an 
expression cassette comprising the FucT inhibitory polynucleotide (Figure 13), or an 

1 5 expression cassette comprising the chimeric FucT/XylT inhibitory polynucleotide 

(Figure 12); and wherein the LaUbq promoter and LaUbq intron are included as 
upstream regulatory sequences in an expression cassette comprising the XylT 
inhibitory polynucleotide (Figure 13). 

It is recognized that any of the expression-enhancing nucleotide sequence 

20 modifications described above can be used in the present invention, including any 

single modification or any possible combination of modifications. The phrase 
''modified for enhanced expression" in a plant, for example, a duckweed plant, as 
used herein refers to a polynucleotide sequence that contains any one or any 
combination of these modifications. 

25 

Signal Peptides 

It is recognized that the heterologous polypeptide of interest may be one that is 
normally or advantageously expressed as a secreted protein. Secreted proteins are 
usually translated from precursor polypeptides that include a "signal peptide" that 
30 interacts with a receptor protein on the membrane of the endoplasmic reticulum (ER) 

to direct the translocation of the growing polypeptide chain across the membrane and 
into the endoplasmic reticulum for secretion from the cell. This signal peptide is 
often cleaved from the precursor polypeptide to produce a "mature" polypeptide 
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lacking the signal peptide. In an embodiment of the present invention, a biologically 
active polypeptide is expressed in the plant host of interest, for example, duckweed or 
other higher plant, from a polynucleotide sequence that is operably linked with a 
nucleotide sequence encoding a signal peptide that directs secretion of the polypeptide 
5 into the culture medium. Plant signal peptides that target protein translocation to the 

endoplasmic reticulum (for secretion outside of the cell) are known in the art. See, for 
example, U.S. Patent No. 6,020,169 to Lee et al In the present invention, any plant 
signal peptide can be used to target the expressed polypeptide to the ER. 

In some embodiments, the signal peptide is the Arabidopsis thaliana basic 

10 endochitinase signal peptide (amino acids 14-34 of NCBl Protein Accession No. 

BAA82823), the extensin signal peptide (Stiefel et al (1990) Plant Cell 2:785-793), 
the rice a-amylase signal peptide (amino acids 1-31 of NCBI Protein Accession No. 
AAA33885), or a modified rice a-amylase signal sequence (SEQ ID NO: 17). In 
another embodiment, the signal peptide corresponds to the signal peptide of a secreted 

1 5 duckweed protein. 

Alternatively, a mammalian signal peptide can be used to target recombinant 
polypeptides expressed in a genetically engineered plant of the invention, for 
example, duckweed or other higher plant of interest, for secretion. It has been 
demonstrated that plant cells recognize mammalian signal peptides that target the 

20 endoplasmic reticulum, and that these signal peptides can direct the secretion of 

polypeptides not only through the plasma membrane but also through the plant cell 
wall. See U.S. Patent Nos. 5,202,422 and 5,639,947 to Hiatt et al In one 
embodiment of the present invention, the mammalian signal peptide that targets 
polypeptide secretion is the human a-2b-interferon signal peptide (amino acids 1-23 

25 of NCBI Protein Accession No. AAB59402). 

In one embodiment, the nucleotide sequence encoding the signal peptide is 
modified for enhanced expression in the plant host of interest, for example, duckweed 
or other higher plant, utilizing any modification or combination of modifications 
disclosed above for the polynucleotide sequence of interest. 

30 The secreted biologically active polypeptide can be harvested from the culture 

medium by any conventional means known in the art and purified by 
chromatography, electrophoresis, dialysis, solvent-solvent extraction, and the like. In 
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this manner, purified polypeptides, as defined above, can be obtained fi*om the culture 
medium. 

Thus, in some embodiments, the protein expression host system is a plant, for 
example, a duckweed or other higher plant, and the secreted biologically active 

5 polypeptide is a glycoprotein of the invention, where the glycoprotein has a 

substantially homogeneous glycosylation profile, and is substantially homogeneous 
for the GO glycoform. In such embodiments, any such glycoprotein that may remain 
within the plant material can optionally be isolated and purified as described above. 
The secreted glycoprotein can be obtained from the plant culture medium and purified 

10 using any conventional means in the art as noted above. In this manner, the purified 

glycoprotein obtained from the plant material is substantially free of plant cellular 
material, and includes embodiments where the preparations of glycoprotein have less 
than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of contaminating plant 
protein. Where the purified glycoprotein is obtained from the plant culture medium, 

15 the plant culture medium represents less than about 30%, 20%. 10%, 5%, or 1% (by 

dry weight) of chemical precursors or non-protein-of-interest chemicals within the 
purified glycoprotein preparation. 

In some embodiments, these purified glycoproteins obtained from the plant 
host can include at least 0.001%, 0.005%, 0. 1%, 0.5%, 1%, 1.5%. 2%, 2.5%, 3%, 

20 3.5%, 4%. 4.5%, 5%, 5.5%, 6%, 6.5%, 7%, 7.5%, 8%, 8.5%, 9%, 9.5%, 10%, 15%, 

20%, 25%, or up to about 30% (by dry weight) of contaminating plant protein. In 
other embodiments, where the glycoprotein is collected from the plant culture 
medium, the plant culture medium in these purified glycoproteins can include at least 
0.001%, 0.005%. 0.1%. 0.5%, 1%, 1.5%, 2%, 2.5%. 3%, 3.5%, 4%, 4.5%, 5%, 5.5%, 

25 6%, 6.5%, 7%, 7.5%, 8%, 8.5%, 9%, 9.5%. 10%, 15%, 20%. 25%. or up to about 

30% (by dry weight) of chemical precursors or non-protein-of-interest chemicals 
within the purified glycoprotein preparation. In some embodiments, isolation and 
purification from the plant host, and where secreted, from the culture medium, results 
in recovery of purified glycoprotein that is free of contaminating plant protein, free of 

30 plant culture medium components, and/or free of both contaminating plant protein and 

plant culture mediimi components. 
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Transformed Plants and Transformed Duckweed Plants and Duckweed Nodule 
Cultures 

Transformation protocols as well as protocols for introducing nucleotide 
sequences into plants may vary depending on the type of plant or plant cell or nodule, 
S that is, monocot or dicot, targeted for transformation. Suitable methods of 

introducing nucleotide sequences into plants or plant cells or nodules include 
microinjection (Crossway et al. (1986) Biotechniques 4:320-334), electroporation 
(Riggs et al. (1986) Proc, Natl Acad, ScL USA 83:5602-5606), Agrobacterium- 
mediated transformation (U S. Patent Nos. 5,563,055 and 5,981,840, both of which 

10 are herein incorporated by reference), direct gene transfer (Paszkowski et ai (1984) 

EMBOJ, 3:2717-2722), ballistic particle acceleration (see, e.g., U.S. Patent Nos. 
4,945,050; 5,879,918; 5,886,244; and 5,932,782 (each of which is herein incorporated 
by reference); and Tomes et al. (1995) "Direct DNA Transfer into Intact Plant Cells 
via Microprojectile Bombardment," in Plant Cell Tissue, and Organ Culture: 

1 5 Fundamental Methods, ed. Gamborg and Phillips (Springer-Verlag, Berlin); McCabe 

et al (1988) Biotechnology 6:923-926). The cells that have been transformed may be 
grown into plants in accordance with conventional ways. See, for example, 
McCormick et al (1986) Plant Cell Reports 5:81-84. 

The stably transformed duckweed utilized in this invention can be obtained by 

20 any method known in the art. In one embodiment, the stably transformed duckweed 

is obtained by one of the gene transfer methods disclosed in U.S. Patent No. 
6,040,498 to Stomp et al, herein incorporated by reference. These methods include 
gene transfer by ballistic bombardment with microprojectiles coated with a nucleic 
acid comprising the nucleotide sequence of interest, gene transfer by electroporation, 

25 and gene transfer mediated by Agrobacterium comprising a vector comprising the 

nucleotide sequence of interest. In one embodiment, the stably transformed 
duckweed is obtained via any one of the Agrobacterium-iiicA\2XQd methods disclosed 
in U.S. Patent No. 6,040,498 to Stomp et al The Agrobacterium used is 
Agrobacterium tumefaciens or Agrobacterium rhizogenes. 

30 It is preferred that the stably transformed duckweed plants utilized in these 

methods exhibit normal morphology and are fertile by sexual reproduction. 
Preferably, transformed plants of the present invention contain a single copy of the 
transferred nucleic acid, and the transferred nucleic acid has no notable 
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rearrangements therein. Also preferred are duckweed plants in which the transferred 
nucleic acid is present in low copy numbers (i.e., no more than five copies, 
alternately, no more than three copies, as a ftirther alternative, fewer than three copies 
of the nucleic acid per transformed cell). 
5 The following examples are offered by way of illustration and not by way of 

limitation. 



EXPERIMENTAL 
Lemna, a small aquatic plant, is a scalable and economically attractive 

10 expression platform for the manufacture of therapeutic proteins free of human 

pathogens and with a clear path towards regulatory approval. The Lemna expression 
system (LEX System ) enables rapid clonal expansion of transgenic plants, secretion 
of transgenic proteins, high protein yields, ease of containment that is comparable to 
mammalian cell culture systems such as CHO cells, and has the additional advantage 

1 5 of low operating and capital costs (Gasdaska et al (2003) Bioprocessing 7. 50-56). In 

addition, this plant expression system offers the advantage of high protein yields (in 
the range of 6-8% of the total soluble protein (TSP)). These expression levels, in 
combination with Lemna *s high protein content and fast growth rate (36 hr doubling 
time), enable production of >lg of mAb per kg biomass in a robust and well- 

20 controlled format. 

The following examples demonstrate how humanization of the glycosylation 
profile of a mAb was accomplished by coexpression of the mAb with an interference 
RNA (RNAi) construct targeting the endogenous expression of al ,3- 
fucosyltransferase and pi,2-xylosyltransferase genes. The resultant mAb contained a 

25 single major N-glycan species (>95%) devoid of the plant specific a- 1 ,3 -linked 

fucose and p-1 ,2-linked xylose sugars. In receptor binding assays, this glycan 
optimized mAb exhibited enhanced effector cell receptor binding activity when 
compared to mAb produced in wild-type Lemna having the native glycosylation 
machinery and mAb produced in CHO cells. 

30 

Example 1 : Isolation of Lemna minor Proteins Involved in A^-Glvcosvlation of 

Proteins 

In order to generate recombinant proteins with remodeled A^-glycan, alpha 1-3 

35 fucosyltransferase and (31-2 xylosyltransferase were selected as targets for RNAi gene 

- Ill - 
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silencing in L minor. Initial results from cDNA sequencing efforts indicated that two 
or more isoforms were present for each of the target genes. Sequence homology 
between the isoforms was determined to be between 90% and 95%. Full length 
cDNA sequences for both target genes were retrieved and characterized. The fiiU- 
5 length cDNA sequence, including 5 - and 3 -UTR, for L minor a 1-3 

fucosyl transferase (FucT) is set forth in Figure 1; see also SEQ ID N0:1 (open 
reading frame set forth in SEQ ID N0:2). The predicted amino acid sequence 
encoded thereby is set forth in SEQ ID N0:3. The encoded protein shares some 
similarity with other FucTs from other higher plants. See Figure 2. For example, the 

10 L minor FucT sequence shares approximately 50. 1% sequence identity with the 

Arabidopsis thaliana FucT shown in Figure 2. 

The full-length cDNA sequence, including 5'- and 3*-UTR, for Z. minor pi -2 
xylosyltransferase (XylT) (isoform #1) is set forth in Figure 3; see also SEQ ID N0:4 
(ORF set forth in SEQ ID N0:5). The predicted amino acid sequence encoded 

15 thereby is set forth in SEQ ID NO: 6. The encoded protein shares some similarity with 

other XylTs from other higher plants. See Figure 4. For example, the L minor XylT 
shares approximately 56.4% sequence identity with the Arabidopsis thaliana XylT 
shown in Figure 4, A partial-length cDNA sequence, including 3 -UTR, for L minor 
pi-2 xylosyltransferase (XylT) (isoform #2) is set forth in Figure 3 1; see also SEQ ID 

20 NO: 19 (ORF set forth in SEQ ID NO:20). The predicted amino acid sequence 

encoded thereby is set forth in SEQ ID N0:21 . The partial-length XylT isoform #2 
shares high sequence identity with the corresponding region of the fiill-length XylT 
isoform #1, as can be seen from the alignment shown in Figure 32. 

25 Example 2: RNAi Inhibition of Expression of L. minor FucT and XvlT 

Several RNAi strategies were undertaken to inhibit expression of the L minor 
FucT and XylT isoforms. Figures 5-7, 33, and 34 outline these strategies. Figures 8- 
13 show maps of the various constructs that were made to achieve the desired 
30 knockout of expression of these two genes. A number of transgenic lines comprising 

the various knockout RNAi constructs were generated using standard transformation 
protocols described herein above. 

The test antibody, designated herein as mAbI, was expressed in wild-type 
Lemna having the native glycosylation machinery, and transgenic Lemna lines 
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expressing RNAi constructs designed to inhibit expression of L, minor XylT and 
FucT isoforms. Generally, three binary vectors were constructed for expression of 
mAbI in the Lemna system. Expression vector mAblOl contained codon optimized 
genes encoding heavy (H) and light (L) chains of mAbI; vector mAbT04 contained 
S codon optimized genes encoding mAbI H and L chains and a chimeric RNAi 

construct targeting expression of both XylT and FucT isoforms; and vector mAblOS 
contained codon optimized genes encoding mAbI H and L chains, a single-gene 
RNAi construct targeting FucT gene expression, and a single-gene RNAi construct 
targeting XylT gene expression. Independent transgenic lines were generated for the 

10 mABIOl , mAbl04, and mAblOS expression vectors. 

Optimized genes for mAbI H and L chains were designed to have Lemna 
preferred codon usage (63% - 67% GC content) and contain the rice a-amylase signal 
sequence (GenBank M24286) fused to the 5' end of their coding sequences. 
Restriction endonuclease sites were added for cloning into Agrobacierium binary 

15 vectore (EcoRI (50 / Sad (3'), H-chain) and (Sail (50 / HindHI (3'), L-chain). 

For the XF02 data presented in Figures 15-17 and the mAbI04 data presented 
in Figures 22-24 and 26, described herein below, the RNAi strategy for inhibiting 
expression of the L, minor FucT and XylT isoforms employed the chimeric RNAi 
20 design shown in Figure 34. For the mAbI05 data presented in Figure 27, described 

herein below, the RNAi strategy for inhibiting expression of the L. minor FucT and 
XylT isoforms employed a double knockout of these genes using a combination of the 
single gene RNAi designs shown in Figure 5 (FucT RNAi design) and Figure 33 
(XylT RNAi design). 

25 Independent expression cassettes containing promoter, gene of interest, and 

Nos terminator were created for the optimized mAbl H and L chains and the single- 
gene or chimeric RNAi. Expression cassettes were cloned into a modification of the 
Agrobacterium binary vector pBMSP3 (obtained from Dr. Stan Gelvin, Purdue 
University) with the appropriate restriction sites. Depending upon the expression 

30 cassette, the L chain was fused to either the modified chimeric octopine and 

mannopine synthase promoter with Lemna gibba 5* RbcS leader (mAblOl, Figure 14) 
or the high expression, constitutive Lemna minor polyubiquitin promoter (LmUbq) 
(mAbI04, Figure 12; mAbI05; Figure 13). The H-chain was fused to the modified 
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chimeric octopine and mannopine synthase promoter with Lemna gibba 5* RbcS 
leader (mAbI04, mAblOS, and mAblOl). The chimeric RNAi cassette, taken from 
plasmid XF02 in T7-4, was fused to the high expression, constitutive Spirodela 
polyrhiza polyubiquitin promoter (SpUbq). The single-gene RNAi cassette for 
5 expression of the FucT inhibitory sequence was driven by the SpUbq promoter; and 

the single gene RNAi cassette for expression of the XylT inhibitory sequence was 
driven by an operably linked expression control element comprising the Lemna 
aequinoctialis ubiquitin promoter pins 5' UTR (LaUbq promoter). The H, L, and 
chimeric RNAi expression cassettes were cloned into the modified pBMSP3 binary 
10 vector in tandem orientation creating plasmid mAbI04. The H, L, and single-gene 

RNAi expression cassettes targeting FucT and XylT expression were cloned into the 
modified pBMSP3 binary vector creating plasmid mAblOS. The H and L expression 
cassettes were cloned into the modified pBMSP3 binary vector creating plasmid 
mAblOl. 

1 5 Though any transformation protocol can be used as noted herein above, in 

some embodiments, the transformation protocol was as follows. Using 
Agrobacterium tumefaciens C58Z707, a disarmed, broad host range C58 strain 
(Hepburn et ai (1985) J. Gen. Microbiol. 737:2961-2969), transgenic plants 
representing individual clonal lines were generated from rapidly growing Lemna 

20 minor nodules according to the procedure of Yamamoto et al (2001) In Vitro Cell 

Dev, Biol. Plant 37:349-353. For transgenic screening, individual clonal lines were 
preconditioned for 1 week at 150 to 200 jxmol m-2s-2 in vented plant growth vessels 
containing SH media (Schenk and Hildebrandt (1972) Can. J. Botany 50:199-204) 
without sucrose. Fifteen to twenty preconditioned fronds were then placed into 

25 vented containers containing fresh SH media, and allowed to^row for two weeks. 

Tissue and media samples from each line were frozen and stored at -70**C. 

A MALDI-TOF assay was developed to measure L minor P-1,2- 
xylosyltransferase (XylT) and a-l,3-fucosyltransferase (FucT) activities (Example 3 
below). 

30 Figures 15-17 represent primary screening data for the XF02, mAbn04, and 

mAblOS plants lines using the aforementioned assay. In this assay, WT (wild-type) 
represents the FucT and XylT activity in wild-type plants while BWT (boiled wild- 
type) represents their activity in boiled plant extracts. Boiled wild-type (BWT) plant 
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extracts are representative of plant material in which FucT and XylT activity has been 
deactivated. This data set shows that several plant lines from each construct have a 
reduced level of FucT and XylT activity compared with wild-type plant lines (WT) 
and a comparable level of activity with boiled wild-type samples (BWT). 
5 Specifically, primary screening data for transgenic RNAi L minor plant lines 

comprising the XF02 construct of Figure 10 are shown in Figures 15 and 16. The 
XF02 construct expresses a chimeric RNAi molecule that targets expression of both 
the L minor FucT and XylT proteins, including the various isoforms of the respective 
proteins. 

10 Figure 17 shows primary screening data for transgenic RNAi L minor plant 

lines comprising the mAbI04 construct of Figure 12 and mAblOS construct of Figure 
13. 

Example 3: MALDI-TOF Assay for A^-glvcan P-l,2-xvlQSvltransferase TXvlT) and g- 
15 1.3-fucosvltransferase (FucT) Activity 

The following modified MALDI-TOF assay was used to determine XylT and 
FucT activity in the transgenic plants described in Example 2 above. 

Materials 

20 

HOMOGENIZATION BUFFER: 50 mM HEPES, pH 7.5, 0.25 M sucrose. 2 mM 

EDTA. 1 mM DTT. 

REACTION BUFFER: 0. 1 M Mes, pH 7.0, 10 mM MnCb, 0. 1% (v/v) 

25 Triton X- 100. 

URIDINE-5'-DIPH0SPH0-i>XYL0SE (UDP-Xyl) 

GUAN0SINE-5'-DIPH0SPH0-L-FUC0SE(GDP-Fuc) 

30 

iV-ACETYLGLUCOSAMINE 

POLYETHYLENE GLYCOL (PEG) MIXTURE 1000-3000 (10 mg/mL PEG 1000, 
2000, and 3000 (4:5:6 ratio) mixed 4: 1 with 2 mg/mL sodium iodide). 

35 

[Glu^]-FIBRINOPEPTIDE B (GFP), HUMAN (1 pmol/pL in water) 
DABSYLATED, TETRAPEPTIDE, AT-GLYCAN ACCEPTOR (EMD Biosciences) 
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CHCA (a-CYANO-4-HYDROXYCINNAMlC ACID) MATRIX (10 mg in 50% 

[v/v] acetonitrile, 0.05% [v/v] trifluoroacetic 
acid). 

5 

Microsome preparation 

L, minor tissue (100 mg) was ground in 1 mL of cold homogenization buffer 
in a bead mill at 5x speed for 40 s. The homogenate was spun at 1 ,000g for 5 min, 
4°C. The supernatant was removed and spun at 18,000g for 2 h, 4°C. The 
10 supernatant was then discarded. The pellet was resuspended in 20 ^L of cold reaction 

buffer and kept on ice or stored at -80^C until use. 



Reaction conditions 

15 The reaction mix contains 125 mM //-acetylglucosamine, 6.25 mM UDP-Xyl, 

6.25 mM GDP'Fuc, 12.5 mM MnCk, and 1.5 nmol of dabsylated, tetrapeptide A^- 
glycan acceptor. Microsomes (4 (xL) were added to the reaction mix to start the 
reaction. The reaction was incubated for 30 min at room temperature, and 90 min at 
37°C. The reaction was terminated by centrifugation at 18,000g for 1 min and 

20 incubation at 4°C. 



MALDI-TOF analysis 

A portion of the supernatant from each reaction (0.5 \ih) was mixed with 0.5 
of CHCA matrix on a MALDI target plate and allowed to dry. The MALDI 
25 instrument was set to reflectron positive ion mode and calibrated with PEG 1000- 

3000. Combined MS spectra (--200 shots) were taken from 1500-2500 Da using 0.5 
pmol GFP as the lock mass. Ion counts of the reference peak (m/z = 2222.865) 
should be above 400, Ion counts of the XylT and FucT products (m/z = 2192,854 and 
2206.870. respectively) were normaUzed to the reference peak and the protein 
30 concentration of the microsome fraction. 

Example 4: Effect of RNAi Inhibition of Expression of L minor FucT and XvlT on 
Glvcosvlarion Profile of Monoclonal Antibodies 

35 Monoclonal antibodies produced by wild-type (i.e., FucT and XylT expression 

not silenced) L. minor comprising the mAblOl construct (see Figure 14) and L. minor 
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lines transgenic for the mAbI04 construct (see Figure 12) or mAblOS construct (see 
Figure 13) were analyzed for their A^-glycosylation profile. The following procedures 
were used. 

5 Purification of mAb from Lemna. 

Plant tissue was homogenized with 50mM Sodium Phosphate, 0.3M Sodium 
Chloride, and lOmM EDTA at pH 7.2 using a Silverson High Shear Mixer at a tissue: 
buffer ratio of 1 :8. The homogenate was acidified to pH 4.5 with IM Citric Acid, and 
centrifuged at 7,500 x g for 30 minutes at 4®C. The supernatant was filtered through a 

10 0.22nm filter and loaded directly on mAbSelect SuRe resin (GE Healthcare) 

equilibrated with a solution containing SOmM Sodium Phosphate, 0.3M Sodium 
Chloride, and lOmM EDTA, pH 7,2. After loading, the column was washed to 
baseline with the equilibration buffer followed by an intermediate wash with 5 
column volumes of 0. IM Sodium Acetate, pH 5.0, and finally, bound antibody was 

15 eluted with 10 column volumes of 0.1 M Sodium Acetate, pH 3.0. The eluate was 

immediately neutralized with 2M Tris base. 

Purification of ^-linked glycans. 

Protein A-purified monoclonal antibodies (1 mg) from wild-type and RNAi L 

20 minor plant lines were dialyzed extensively against water and lyophilized to dryness. 

Samples were resuspended in 100 |iL of 5% (v/v) formic acid, brought to 0.05 mg/mL 
pepsin, and incubated at 37°C overnight. The samples were heat inactivated at 95°C 
for 10 min and dried. Pepsin digests were resuspended in 100 ^iL of 100 mM sodium 
acetate, pH 5.0 and incubated with 1 mU of N-glycosidase A at 37''C overnight. The 

25 released AZ-glycans were isolated using 4 cc Carbograph SPE columns according to 

Packer et al (1998) Glycoconj. J. 15: 731-741, and dried. 

Dried A^-glycans were further purified using 1 cc Waters Oasis MCX 
cartridges. Columns were prepared by washing with 3 column volumes of methanol 
followed by 3 column volumes of 5% (v/v) formic acid. A^-glycans, resuspended in 1 

30 mL of 5% (v/v) formic acid, were loaded onto the prepared columns. The unbound 

fraction as well as 2 additional column volume washes of 5% (v/v) formic acid were 
collected, pooled and dried. 
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Derivatization of oligosaccharides with l-aminobenzoic acid (2-AA), 

Purified /V-glycans or maltooligosacxjharides were labeled with 2-AA and 
purified using 1 cc Waters Oasis HLB cartridges according to Anumula and Dhume 
(1998) Glycobiology 8: 685-694. Labeled A/^glycans and maltooligosaccharides were 
5 resuspended in 50 |iL of water and analyzed by MALDI-TOF MS and normal phase 

(NP) HPLC-QTOF MS. 



MALDI-TOF Mass Spectrometry, 
10 MALDI-TOF MS was conducted using a Waters MALDI Micro MX 

(Millford, MA). 2-AA labeled //-glycans (0.5 ^iL) were properly diluted with water, 
mixed with 0.5 ^L of 10 mg/mL DHB matrix in 70% (v/v) acetonitrile, spotted onto a 
target plate and analyzed in negative reflectron mode. 

1 5 NP-HPLC-Q-TOF MS analysis of 2-AA labeled N-glycans. 

2-AA labeled A^-glycans or maltooligosaccharides were brought to 80% (v/v) 

acetonitrile and separated on a Waters 2695 HPLC system fitted with a TSK-Gel 

Amide-80 (2 mm x 25 cm, 5 ^m) column (Tosoh Biosciences, Montgomeryville, PA). 

2-AA labeled carbohydrates were detected and analyzed by fluorescence (230 nm 
20 excitation, 425 nm emission) using a Waters 2475 fluorescence detector and a Waters 

Q-TOF API US quadropole-time of flight (Q-TOF) mass spectrometer (Millford, 

MA) fitted in-line with the HPLC system. 

Separations were conducted at 0.2 mlVmin, 40°C, using 10 mM ammonium 

acetate, pH 7.3 (solvent A) and 10 mM ammonium acetate, pH 7.3, 80% (v/v) 
25 acetonitrile (solvent B). Sample elution was carried out at 0% A isocratic for 5 min, 

followed by a linear increase to 10% A at 8 min, and a linear increase to 30% A at 48 

min. The column was washed with 100% A for 15 min and equilibrated at 0% A for 

15 min prior to the next injection. 

Q-TOF analysis was conducted in negative ion mode with source and 
30 desolvation temperatures of lOO^C and 300^C, respectively, and capillary and cone 

voltages of 2,100 and 30 V, respectively. Mass spectra shown are the result of 

combining >50 individual scans per labeled //-glycan. 



-118- 



wo 2007/084922 



PCTAJS2007/060642 



RP-HPLC-Q'TOF MS analysis of intact IgG, 

Protein A purified IgG's (50 ^ig) were desalted using the Waters 2695 HPLC 
system fitted with a Poros Rl-10 column (2 mm x 30 mm; Applied Biosystems). 
IgG's were detected and analyzed using a Waters 2487 dual wavelength UV detector 
5 (280 nm) and the Waters Q-TOF API US. Separations were conducted at 0. 1 5 

mL/min, 60°C, using 0.05% (v/v) trifluoroacetic acid (TFA; solvent A) and 0.05% 
(v/v)TFA, 80% (v/v) acetonitrile (solvent B). Sample elution was carried out using a 
linear increase from 30 to 50% B for 5 min, an increase to 80% B for 5 min. The 
solvent ratio remained at 80% B for an additional 4 min, followed by a wash with 
10 100% B for 1 min and equilibration of the column with 30% B for 15 min prior to the 

next run. 

Q-TOF analysis was conducted in positive ion mode with source and 
desolvation temperatures of IOO°C and 300°C, respectively, and capillary and cone 
voltages of 3.0 and 60 V, respectively. Data are the result of combining >100 
1 5 individual scans and deconvolution to the parent mass spectrum using MaxEnt 1 . 

See also Triguero et al. (2005) Plant Biotechnol J. 3: 449-457; Takahashi et 
al {\99%) Anal Biochem, 255: 183-187; Dillon etal (2004) J, Chromatogr. A, 1053: 
299-305. 

20 Results, 

Figure 18 shows the structure and molecular weight of derivatized wild-type 
L minor monoclonal antibody /V-glycans. 

Figure 19 shows that the wild-type mAblOl construct (shown in Figure 15) 
providing for expression of the mAbI monoclonal IgGl antibody in L minor, without 

25 RNAi suppression of L minor FucT and XylT, produces an N-glycosylation profile 

with three major N-glycan species, including one species having the pi,2-linked 
xylose and one species having both the pi,2-linked xylose and core al,3-linked 
fucose residues; this profile is confirmed with liquid chromatography mass 
spectrometry (LC-MS) (Figure 20) and MALDI (Figure 21) analysis. 

30 Figure 22 shows an overlay of the relative amounts of the various iV-glycan 

species of the mAbI produced in the wild-type L minor line comprising the mAblOl 
construct (no suppression of FucT or XylT) and in the two transgenic L. minor lines 
comprising the mAbI04 construct of Figure 12. Note the enrichment of the GnGn 
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(i.e., GO) glycan species, with no pi,2-linked xylose or core al,3-linked fucose 
residues attached, and the absence of the species having the pi,2-linked xylose or 
both the pl,2-linked xylose and core al,3-linked fucose residues. This profile is 
confirmed with mass spec (LC-MS) (Figure 23) and MALDI (Figure 24) analysis. 
S Figure 25 shows intact mass analysis of the mAbI compositions produced in 

wild-type L minor (line 20) comprising the mAblOl construct. When XylT and FucT 
expression are not suppressed in L minor, the recombinantly produced mAbI 
composition is heterogeneous, comprising at least 9 different glycoforms, with the 
GOXF^ glycoform being the predominate species present. Note the very minor peak 

10 representing the GO glycoform. 

Figure 26 shows intact mass analysis of the mAbI compositions produced in 
transgenic L minor (line 15) comprising the mAbI04 construct of Figure 12, When 
XylT and FucT expression are suppressed in L minor using this chimeric RNAi 
construct, the intact mAbI composition is substantially homogeneous for GO 

15 glycans, with only trace amounts of precursor A^-glycans present (represented by the 

GnM and MGn precursor glycan species). In addition, the mAbI composition is 
substantially homogeneous for the GO glycoform, wherein both glycosylation sites are 
occupied by the GO /V-glycan species, with three minor peaks reflecting trace amounts 
of precursor glycoforms (one peak showing mAbI having an Fc region wherein the 

20 Ch2 domain of one heavy chain has a GO glycan species attached to Asn 297, and the 

Ch2 domain of the other heavy chain is unglycosylated; another peak showing mAbI 
having an Fc region wherein the Ch2 domain of one heavy chain has a GO glycan 
species attached to Asn 297, and the Ch2 domain of the other heavy chain has the 
GnM or MGn precursor glycan attached to Asn 297; and another peak showing mAbI 

25 having an Fc region wherein the Asn 297 glycosylation site on each of the Ch2 

domains has a GO glycan species attached, with a third GO glycan species attached to 
an additional glycosylation site within the mAbI structure). 

Figure 27 shows intact mass analysis of the mAbl compositions produced in 
transgenic L minor (line 72) comprising the mAblOS construct of Figure 13, When 

30 XylT and FucT expression are suppressed in L minor using this construct, the intact 

mAbl composition is substantially homogeneous for GO A^-glycans, with only trace 
amounts of precursor A^-glycan species present (represented by the GnM and MGn 
precursor glycan species). In addition, the mAbl composition is substantially 
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homogeneous (at least 90%) for the GO glycoform, with the same three minor peaks 
reflecting precursor glycoforms as obtained with the mAbI04 construct. 

The receptor binding activity of the mAbI produced in the wild-type Lemna 
lines comprising the mAblOl construct (i.e., without inhibition of XylT and FucT 
5 expression) and the transgenic Lemna lines comprising the mAbI04 or mAblOS 

construct (i.e., with XylT and FucT expression inhibited) was compared to the 
receptor binding activity of the mAbI produced in mammalian cell lines (CHO and 
SP2/0). 

Binding to FcFcyRIIIa on freshly isolated human NK cells was assessed for 

10 the various mAbl products. Control data collected for CHO-derived mAbl and 

SP2/0-derived mAbl is shown in Figure 35. Test data collected for wild-type Lemna- 
produced mAbl having the normal plant iV-glycan profile are designated as mAblOl- 
15 and mabI01-20, wherein the mAbl product has //-linked glycans that include 
a(l,3)-fucose residues (see Figures 36 and 37). Test data collected for transgenic 

15 iem«a-derived mAbl having an optimized iV-glycan profile (OPT) obtained with 

gene-silencing RNAi constructs that target expression of al,3-fucosyltransferase are 
designated as mAbI05-72, mAbI05-74, niAbI04-24, and mAbI04-15 (see Figures 36 
and 37), wherein the mAbl product has jV-linked glycans that are devoid of a(l,3)- 
fucose residues. Data comparing binding efficacy of mAblO 1-15, mAblO 1 -20, mAbl 

20 SP2/0, mAbI04-l 5, mAbI04-24, mAbI05-72, and mAbI05-74 to recombinant mouse 

FcyRIV, a receptor that is sensitive to IgG fucose levels and which served as a 
surrogate for human FcyRIIla, is shown in Figure 38. 

These data demonstrate that the transgenic Lem/ja-derived mAbl product 
having the optimized glycan profile (OPT) shows enhanced binding to FcyRIIla on 

25 freshly isolated human NK cells (enhanced about 20 to 50-fold) as well as enhanced 

binding to recombinant mouse FcyRIV (enhanced about 10- fold) as compared to the 
wild-type Lemna-dmy^A mAbl product. 

Example 6: Production of Anti-CD30 Monoclonal Antibodv Having Improved 
30 Receptor Binding and Increased ADCC Activity 

This example outlines the expression of human anti-CD30 mAbs in Lemna, 
Optimization of anti-CD30 mAb glycosylation was accomplished by co-expression 
with an RNAi construct targeting the endogenous expression of a-1,3- 
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fiicosyltransferase (FucT) and p-l,2-xylosyltransferase pCylT) genes in a manner 
similar to that noted in the examples above for mAbl. The resultant anti-CD30 mAb 
produced in Lemna having its native glycosylation machinery engineered to suppress 
FucT and XylT expression contained a single major A/-glycan species without any 
5 trace of plant-specific TV-glycans. In addition to the N-^ycm homogeneity, glyco- 

optimized anti-CD30 mAbs were also shown to have enhanced antibody-dependent 
cell-mediated cytotoxicity (ADCC) and effector cell receptor binding activity when 
compared to CHO-expressed anti-CD30 mAbs, 



10 METHODS 

Strains and Reagents, 

Novablue competent Escherichia coli cells were used for all recombinant 
DNA work (EMD Biosciences, San Diego, CA). Restriction endonucleases and DNA 
modification enzymes were obtained from New England Biolabs (Ipswich, MA). 

15 Oligonucleotides were obtained from Integrated DNA technologies (Coralville, LA). 

Waters Oasis HLB and MCX columns (1 cc), 2,5-dihydroxybenzoic acid (DHB), and 
a-cyano-4-hydroxycinnamic acid (CHCA) were from Waters Corporation (Milford, 
MA). Purified dabsylated, tetrapeptide, GnGn A^-glycan acceptors (GnGn-dabsyl- 
peptide) and //-glycosidase A were from EMD Biosciences. Carbograph SPE 

20 columns (4 cc) were from Grace Davidson Discovery Sciences (Deerfield, IL). 

Uridine-5'-diphospho-D-xylose (UDP-Xyl) was purchased from Carbosource 
Services (Athens, GA). Acetonitrile (Optima grade) was fi'om Fisher Scientific 
(Summerville, NY). Ammonium acetate was from MP Biochemicals (Irvine, CA). 
Maltooligosaccharides (MD6-1) were from V-Labs Inc. (Covington, CA). 

25 Monosaccharide standards were from Dionex (Sunnyvale, CA). BATDA 

(bis(acetoxymethyl)2,2*:6',2"-terpyridine-6,6"-dicarboxylate) and Europium solution 
were from Perkin-Elmer (Wellesley, MA). Guanosine-5*-diphospho-L-fticose (GDP- 
Fuc), A^-acetylglucosamine (GlcNAc), 2-aminobenzoic acid (2-AA) and all other 
materials were from Sigma (St. Louis, MO). 

30 

Construction of mAb and RNAi expression vectors. 

The heavy (H) and light (L) chain variable region cDNA sequences of fiilly 
human mAbl kappa antibody MDX-060 derived from a transgenic Medarex HuMAb- 
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Mouse® (Borchmann et ai (2003) Blood 102:3737-3742) were determined and the 
full length MDX-060 human mAbl kappa antibody was produced recombinantly by a 
Chinese hamster ovary cell line, CHO DG44 (Urlaub et al (1986) Cell Mol. Genet. 
1 2:555-566), using standard techniques. Optimized genes for H and L chains were 
5 designed to have lemw^-preferred codon usage (63%-67% GC content) and contain 

the rice a-amylase signal sequence (GenBank M24286) fused to the 5' end of their 
coding sequences. Restriction endonuclease sites were added for cloning into 
Agrobacterium binary vectors (EcoRT (5') / Sad (3')» H-chain) and (Sail (5') / Hindin 
(3*), L-chain). Synthetic genes were constructed and provided by Picoscript (Houston, 
10 TX). 

A chimeric hairpin RNA (see Figure 34) was designed to target silencing of 
endogenous Lemna genes encoding a-l,3-fucosyltransferase (based on the coding 
sequence for L minor FucT isoform #1, set forth in SEQ ID N0:2; see also GenBank 
DQ789145) and P-l,2-xylosyltransferase (based on the coding sequence ioxL minor 

15 XylT isoform #2, nt 1-1275 of SEQ ID N0:19; set forth in SEQ ID NO:20; see also 

GenBank DQ789146). The chimeric FucT + XylT hairpin RNA was designed to have 
602 bp of double stranded FucT sequence, 626 bp of double stranded XylT sequence, 
and 500 bp of spacer sequence. The sense strand portion of the hairpin RNA cassette 
encompasses the FucT forward fragment (nt 12-613 of SEQ ID N0:2; equivalent to nt 

20 254-855 of SEQ ID NO: 1) and XylT forward fragment (nt 1-626 of SEQ ID NO: 19), 

a spacer sequence (nt 627-1 126 of SEQ ID NO: 19). The antisense strand portion of 
the hairpin RNA encompasses the XylT reverse fragment (antisense version of nt 1- 
626 SEQ ID NO: 19) and FucT reverse fragment (antisense version of nt 12-613 of 
SEQ ID N0:2 or nt 254-855 of SEQ ID NO: 1). The chimeric hairpin RNA was 

25 constructed by PGR amplifying FucT and XylT forward and reverse gene fragments 

from Lemna minor (8627) cDNA and sequentially cloning them into pT7blue (EMD 
Biosciences) creating plasmid XF02 in T7-4. The FucT forward gene fragment was 
amplified with DNA primers BLX 686 (5'-ATGGTCGACTGCTGCTGGTGCTC 
TCAAC-3') (SEQ ID NO:22) and BLX690 (5'- ATGTCTAGAATG 

30 CAGCAGCAAGTGCACC-3') (SEQ ID NO:23) producing a 620 bp product with 

terminal Sail (5*) and Xbal (3*) cloning sites. The XylT forward gene fragment was 
amplified with DNA primers BLX 700 (5'-ATGACTAGTTGC 
GAAGCCTACTTCGGCAACAGC30 (SEQ ID NO:24) and BLX694 (5'- 

- 123- 



wo 2007/084922 



PCT/US2007/060642 



ATGGGATCCGAATCTCAAGA ACAACTGTCG-30 (SEQ ID NO:25) producing a 
1 144 bp product with terminal Spel (5') and BamHI (3') cloning sites. The XylT 
reverse gene fragment was amplified with DNA primers BLX 695 (5'- 
ATGGGTACCTGCGAAGCCTACTTCGGCAA CAGC-30 (SEQ ID NO:26) and 
5 BLX696 (5'- ATGGGA TCCACTGGCTGGGAGAAGTTCTT-3') (SEQ ID NO:27) 

producing a 644 bp product with terminal BamHI (5') and Kpnl (3') cloning sites. 
The FucT reverse gene fragment was amplified with DNA primers BLX 69 1 (5'- 
ATGGAGCTCTGCTGCTGGTGCT CTCAAC-3') (SEQ ID NO:28) and BLX692 (5'- 
ATGGGTACCATGCAGCAGCAAGTGCACC-3') (SEQ ID NO:29) producing a 620 

10 bp product with terminal Kpnl (5') and Sad (3') cloning sites. 

Independent expression cassettes containing promoter, gene of interest, and 
Nos terminator were created for the optimized MDX-060 H and L chains and the 
chimeric RNAi. Expression cassettes were cloned into a modification of the 
Agrobacterium binary vector pBMSP3 (obtained from Dr. Stan Gelvin, Purdue 

15 University) with the appropriate restriction sites. The H chain was fused to the 

modified chimeric octopine and mannopine synthase promoter with Lemna gibba 5' 
RbcS leader (Gasdaska et ai (2003) Bioprocessing J, 50-56). The L-chain was fused 
to the high expression, constitutive Lemna minor polyubiquitin promoter (LmUbq). 
The chimeric RNAi cassette, taken from plasmid XF02 in T7-4, was fused to the high 

20 expression, constitutive Spirodela polyrhiza polyubiquitin promoter (SpUbq). The 

three expression cassettes were cloned into the modified pBMSP3 binary vector in 
tandem orientation creating plasmid MDXA04. 



Transformation and plant line screening, 

25 Using Agrobacterium tumefaciens C58Z707 (Hepburn et al. ( 1 985) J. Gen, 

Microbiol 131:2961-2969), transgenic plants representing individual clonal lines 
were generated from rapidly growing Lemna minor nodules according to the 
procedure of Yamamoto et al. (Y amamoto et al (2001) In vitro Cell Dev. Biol 37). 
For transgenic screening, individual clonal lines were preconditioned for 1 week at 

30 150 to 200 [xmo\ m'h'^ in vented plant growth vessels containing SH media (Schenk 

and Hildenbrandt (1972) Can. J. Botany 50:199-204) without sucrose. Fifteen to 
twenty preconditioned fronds were then placed into vented containers containing fresh 
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SH media, and allowed to grow for two weeks. Tissue and media samples from each 
line were frozen and stored at -70°C. 



EUSA analysis of mAb produced in Lemna, 

5 Lemna tissue (100 mg) was homogenized using a FastPrep FP120 bead mill 

(Thermo Electron Corporation). Supematants were diluted to 1 ^ig/mL and assayed 
using the IgG Quantitation ELISA kit (Bethyl Laboratories). For the assay, microtiter 
plates were coated with goat anti-human IgG at a concentration of 10 ^g/mL, and 
mAb was detected by horseradish peroxidase (HRP)-conjugated goat anti-human IgG 

10 diluted 1: 100,000. Standard curves were created with Human Reference IgG supplied 

with the EUSA kit. The sensitivity of the EUSA was 7.8 ng/mL. All samples were 
analyzed in duplicate. 



Preparation of Lemna microsomal membranes and assaying for core ^-1,2- 

1 5 xylosyltransferase and a- IJ-JUcosy transferase activities. 

Lemna tissue (100 mg) from each line was homogenized in 1 mL of cold 
homogenization buffer (50 mM 4-[2-hydroxyethyl]-l-piperazineethanesulfonic acid 
[HEPES], pH 7.5, 0.25 M sucrose, 2 mM ethylenediaminetetraacetic acid [EDTA], 1 
mM 1 ,4-dithiothreitol [DTT]) for 40 s in a FastPrep FP120 bead mill (Thermo 

20 Electron Corporation, Waltham, MA). The homogenate was centrifiiged at 1,000 g 

for 5 min at 4*'C, The supernatant was removed and centrifiiged at 18,000g for 90 
min at 4°C. The resulting pellet was resuspended in 20 \iL of cold reaction buffer 
(0.1 M 2-[4-morpholino] ethanesulfonic acid [Mes], pH 7.0, 0.1% [v/v] Triton X-100, 
10 mM MnCla) and kept on ice or stored at -SO^'C until use. 

25 Core P- 1 ,2-xylosyltransferase and a- 1 ,3-fiicosyltransferase activities were 

measured simultaneously in 4 ^L of microsomal membranes prepared from each 
RNAi line by incubating with 125 mM GlcNAc, 6.25 mM UDP-Xyl, 6.25 mM GDP- 
Fuc, 12.5 mM MnCb, and 1.5 nmol of GnGn-dabsyl-peptide acceptor for 2 h at 3TC 
as described previously (Leiter fl/. {1999) J. Biol Chem. 274:21830-21839). The 

30 reaction was terminated by a brief centrifijgation and incubation at 4®C and the 

products were analyzed by positive reflectron mode MALDI-TOF MS. 
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Purification ofMDX-060 LEXandLEX^' mAbs. 

Plant tissue was homogenized with 50 niM sodium phosphate, 0.3 M sodium 
chloride, and 10 mM EDTA, pH 7.2 using a Silverson high shear mixer. The 
homogenate was acidified to pH 4.5 with 1 M citric acid, and centrifiiged at 7,500g 
5 for 30 min at 4X. The pH of the supematant was adjusted to pH 7.2 with 2 M Tris, 

prior to filtration using 0.22 ^m filters. The material was loaded directly on 
mAbSelect SuRe protein A resin (GE Healthcare) equilibrated with a solution 
containing 50 mM sodium phosphate, 0.3 M sodium chloride, and 10 mM EDTA, pH 
7.2. After loading, the column was washed to baseline with the equilibration buffer 

10 followed by an intermediate wash with 5 column volumes of 0.1 M sodium acetate, 

pH 5.0. Bound antibody was eluted with 10 column volumes of 0.1 M sodium 
acetate, pH 3.0. The protein A eluate was immediately neutralized with 2 M 2-amino- 
2-[hydroxymethyl]-l,3-propanediol (Tris). For aggregate removal, the protein A 
eluate was adjusted to pH 5.5 and applied to a ceramic hydroxyapatite type I (Bio- 

15 Rad) column equilibrated with 25 mM sodium phosphate, pH 5.5. After washing the 

column with 5 column volumes of equilibration buffer, the antibody was eluted in a 
single step-elution using 0.25 M sodium phosphate, pH 5.5. Fractions containing 
antibody by A280 were pooled and stored at -SO'^C. 

Tissue extract and protein A flow through samples were prepared for SDS- 

20 PAGE under reducing and non-reducing conditions by addition of 2x SDS sample 

buffer ± 5% [v/v] 2-mercaptoethanol. Protein A eluate and hydroxyapatite eluate 
samples were diluted to a protein concentration of 0.5mg/mL followed by addition of 
2x SDS sample buffer ± 5% [v/v] 2-mercaptoethanol. Samples were incubated at 
95°C for 2 minutes prior to electrophoresis using 4-20% Tris-Glycine gradient gels 

25 (Invitrogen, Carlsbad, CA). Markl2 Molecular weight markers (Invitrogen) and a 

MDX-060 reference standard were included on the gels. Gels were stained with 
Colloidal Blue stain. 

Purification of N-linked glycans. 
30 Protein A purified monoclonal antibodies (1 mg) from wild-type and RNAi 

Lemna plant lines were dialyzed extensively against water and lyophilized to dryness. 
Samples were resuspendcd in 100 \xh of 5% (v/v) formic acid, brought to 0.05 mg/ml 
pepsin, and incubated at 37®C overnight. The samples were heat inactivated at 95®C 
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for 15 min and dried. Pepsin digests were resuspended in 100 ^iL of 100 mM sodium 
acetate, pH 5.0 and incubated with 1 mU of A^-glycosidase A at 37°C overnight. The 
released A^-glycans were isolated using 4 cc Carbograph SPE columns (Packer et al 
(1 998) Gfycoconj, J. 1 9:737-747) and dried. 

5 Dried A^-glycans were further purified using 1 cc Waters Oasis MCX 

cartridges. Columns were prepared by washing with 3 column volumes of methanol 
followed by 3 column volumes of 5% (v/v) formic acid. iV-glycans, resuspended in 1 
mL of 5% (v/v) formic acid, were loaded onto the prepared columns. The unbound 
fraction as well as 2 additional column volume washes of 5% (v/v) formic acid were 

10 collected, pooled, and dried. 



Derivatization of oligosaccharides with 2-aminobenzoic acid (2-AA). 

Purified A/-glycans or maltooligosaccharides were labeled with 2-AA and 
purified using 1 cc Waters Oasis HLB cartridges according to Anumula and Dhume, 
15 1998 (Anumula and Dhume (1 998) Glycobiology 8:685-694). Labeled A^-glycans and 

maltooligosaccharides were resuspended in 50 (iL of water and analyzed by negative 
mode MALDI-TOF MS and NP-HPLC-QTOF MS. 



MALDI'TOF Mass Spectrometry, 

20 MALDI-TOF MS was conducted using a Waters MALDI Micro MX 

(Millford, MA). Analysis of p- 1 ,2-xylosyltransferase/a- 1 ,3-fiicosyltransferase 
reaction products was conducted by mixing 0.5 ^L of each reaction supernatant with 
0.5 ^L of 10 mg/mL CHCA in 0.05% (v/v) TFA, 50% (v/v) acetonitrile on a target 
plate. Xylosylated ([M+H]"^ = 2192.85 Da) or fucosylated ([M+H]^ = 2206.87 Da) 

25 GnGn-dabsyl-peptide products were detected in positive reflectron mode. Ton counts 

of 200 combined spectra from each sample were normalized against that of 1,4- 
galactosylated, GnGn-dabsyl-peptide ([M+H]^ = 2222.87 Da) present as a 
contaminant (<5%) in the original GnGn-dabsyl-peptide mixture from EMD 
Biosciences. 

30 2-AA labeled /V-glycans or maltooligosaccharides (0.5 ^L) were diluted with 

water, mixed with 0.5 ^L of 10 mg/ml DHB matrix in 70% (v/v) acetonitrile, spotted 
onto a target plate and analyzed in negative reflectron mode. 
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NP'HPLC-Q-TOF MS analysis ofl-AA labeled N-glycans. 

2-AA labeled A^-glycans or maltooligosaccharides were brought to 80% (v/v) 
acetonitrile and separated on a Waters 2695 HPLC system fitted with a TSK-Gel 
Amide-80 (2 mm x 25 cm, 5 \m) column (Tosoh Biosciences, Montgomeryville, PA). 
5 2-AA labeled carbohydrates were detected and analyzed using a Waters 2475 

fluorescence detector (230 nm excitation, 425 nm emission) and a Waters Q-TOF API 
US quadropole-time of flight (QTOF) mass spectrometer fitted on-line with the HPLC 
system. 

Separations were conducted at 0.2 mlVmin, 40°C, using 10 mM ammonium 
10 acetate, pH 7,3 (solvent A) and 10 mM ammonium acetate, pH 7.3, 80% (v/v) 

acetonitrile (solvent B). Sample elution was carried out at 0% A isocratic for 5 min, 
followed by a linear increase to 10% A at 8 min, and a linear increase to 30% A at 48 
min. The column was washed with 100% A for 15 rain and equilibrated at 0% A for 
IS min prior to the next injection. 
15 QTOF analysis was conducted in negative ion mode with source and 

desolvation temperatures of lOO^C and 300°C, respectively, and capillary and cone 
voltages of 2,100 and 30 V, respectively. Mass spectra shown are the result of 
combining >40 individual scans per labeled A^-glycan. 

20 Monosaccharide analysis by HPAEC-PAD. 

mAb samples (200 ng) were subjected to acid hydrolysis using 2 N TFA at 
lOOX for 3 h. Samples were dried by vacuum centrifugation at ambient temperature 
and reconstituted in 150 |iL water prior to analysis by HPAE-PAD (Dionex). An 
aliquot (25 ^iL) of the reconstituted sample was applied to a CarboPac PA 10 column 

25 (4 x 250 mm) with a pre-column Amino Trap (Dionex). Separation of 

monosaccharides was accomplished with a mobile phase of 3.5 mM KOH, using an 
EG40 eluent generator. Monosaccharide peak identity and relative abundance were 
determined using monosaccharide standards. 

30 Thermal stability of mAb, 

A MicroCal (Northampton, MA) VP-Capillary differential scanning 
calorimetry (DSC) instrument was used to determine thermal stability of glycol- 
optimized and wild-type mAbs. Purified mAb samples were dialyzed in 20 mM 
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NaH2P04, pH 7,4, 150 mM NaCl (PBS) overnight. Thermal denaturation data was 
collected by heating the samples at a concentration of 300 fig/mL from 35 to 95 X at 
a scan rate of PC/min using PBS as the reference buffer. The feedback and gain were 
set to low. The baseline-corrected and normalized data was fit to a non-2-state model 
5 using Origin v7.0 software. 



FcR binding activity of mAb. 

The experiment was conducted using a BIACORE (Biacore AB, Uppsala, 
Sweden) instrument using surface plasmon resonance technology. mAbs, 2 |ig/mL, 

10 were captured on the antigen coated surface (recombinant human CD30). Several 

concentrations of both the Val'^^ and Phe^^* allotypes of FcRylTIa, starting from 6 
^M, were flowed over the captured antibodies for 3 min. The binding signal as a 
function of FcRyllla was fit to a one-site binding model using GraphPad Prism 
(v4.01) software to obtain the Kd values. HBS-EP buffer (10 mM HEPES, 0. 15 M 

15 NaCI. 3 mM EDTA and 0.005% (v/v) P20 at pH 7.4) was used throughout the 

experiment. Binding of the mAbs to buffer or FcRyllla to blank surfaces were used 
as negative controls. 

Assay for antigen binding affinity, 

20 CD30-expressing L540 cells (DSMZ Cell Culture Collection # ACC 72) were 

used as antigen positive cells to assay for binding. Aliquots of 2 x 10^ cells/well were 
incubated for 30 min at 4°C with 100 |iL of primary antibody at the indicated 
concentrations. Cells were washed twice in PBS with 2% (v/v) fetal bovine serum 
(FBS) before addition of goat anti -human mAb, FFTC-labeled secondary antibody 

25 (Jackson ImmunoResearch, West Grove, PA) at 1 :500 dilution in 100 ^iL/well for 30 

min at 4*'C. Cells were washed twice in PBS with 2% (v/v) FBS and assayed by flow 
cytometry using a FACS Calibur instrument (Becton Dickinson, Franklin Lakes, NJ). 
ECso values of MDX-060 CHO, LEX and LEX^' mAb binding to CD30 on L540 
cells were determined from binding curves utilizing GraphPad Prism 3.0 software. 



30 



ADCC assay. 

Human peripheral blood mononuclear effector cells were purified from 
heparinized whole blood by standard FicoU-Paque separation. Cells (2 x 10^) were 
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washed in PBS and sent for genotyping. The remaining effector cells were then 
resuspended at 1 x 10^ cells/mL in RPMI 1640 medium containing 10% (v/v) FBS 
and 50 U/mL of human JL-2 (Research Diagnostics, Concord, MA) and incubated 
overnight at 37°C. The effector cells were washed once in culture medium and 
5 resuspended at 1 x 10^ cells/mL prior to use. L540 target cells at 1 x 10^ cells/mL in 

RPMI 1640 medium containing 10% (v/v) FBS and 5 mM probenecid were labeled 
with 20 ^M BATDA (bis(acetoxymethyl)2,2':6',2"-terpyridine-6,6"-dicarboxylate) 
for 20 min at 37^C. Target cells were washed three times in PBS supplemented with 
20 mM HEPES and 5 mM probenecid, resuspended at 1 x lO^cells/mL and added to 

10 effector cells in 96-well plates (1 x 10^ target cells and 5x10^ effector cells/well) at a 

fmal target to effector ratio of 1:50. Maximal release was obtained by incubation of 
target cells in 3% (v/v) Lysol and spontaneous release obtained by incubation in cell 
culture medium alone. After 1 h incubation at 37^C, 20 of supernatant was 
harvested from each well and added to wells containing 180 of Europium solution. 

15 The reaction was read with a Perkin Elmer Fusion Alpha TRF reader using a 400 jisec 

delay and 330/80, 620/10 excitation and emission filters respectively. The counts per 
second were plotted as a function of antibody concentration and the data was analyzed 
by non-linear regression, sigmoidal dose response (variable slope) using GraphPad 
Prism 3.0 software. The percent specific lysis was calculated as: (experimental 

20 release - spontaneous release)/(maximal release - spontaneous release) x 100. In all 

studies, human mAbl isotype control was included and compared to MDX-060 CHO, 
LEX, and LEX^^* mAbs. Other controls included target and effector cells with no 
mAb, target cells with no effector cells and target and effector cells in the presence of 
3% (v/v) Lysol. 

25 

RESULTS 

Expression of MDX-060 mAb in the LEX system, 

MDX-060 is an anti-CD30 antibody (formally known as 5F1 1) being 
developed for the treatment of Hodgkins lymphoma and anaplastic large cell 
30 lymphoma (Borchmann et al (2003) Blood 102:3737-3742). Two binary vectors 

were constructed for the expression of MDX-060 in the LEX system. Expression 
vector MDXAOl contained codon optimized genes encoding heavy (H) and light (L) 
chains of MDX-060 while vector MDXA04 contained genes encoding H and L chains 
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in addition to a chimeric FucT/XylT RNAi gene (Figure 39). Independent transgenic 
lines were generated for both the MDXAOl (165 lines) and MDXA04 (195 lines) 
expression vectors. For simplicity, MDXAOl derived mAbs (wild-type N- 
glycosylation), and MDXA04 derived mAbs (containing the FucT/XylT RNAi 
5 construct) will be referred to as MDX-060 LEX and MDX-060 LEX^\ respectively, 

in the discussions below. 

Transgenic plant lines were first screened for mAb expression with an IgG 
ELTSA. LEX^* lines with high levels of mAb expression were assayed further for 
FucT and XylT activity. Transferase activities in the majority of the high expressing 

10 MDX-060 LEX^^ lines were reduced to levels of the negative control indicating 

effective silencing in the majority of the assayed lines (Figure 40). MDX-060 LEX^^ 
lines did not exhibit any morphological or growth differences compared to wild-type 
Lemna plants (data not shown). 

Thermal stabilities of the MDX-060 CHO, LEX, and LEX^^mAbs were 

15 determined using differential scanning calorimetry (DSC). All three mAbs exhibited 

similar melting curve kinetics (data not shown) and melting transition point 
temperatures (Table 2 below), further demonstrating the structural integrity of the 
Lemna-producQd MDX-060 LEX and LEX^* mAbs compared to the MDX-060 CHO 
mAb. SDS-PAGE analysis under non-reducing (Figure 41 A) and reducing conditions 

20 (Figure 4 1 B) showed that mAbs from the MDX-060 LEX^' and MDX-060 CHO 

lines had similar protein profiles with the mAb appearing as the major component in 
the protein extract. 



Table 2. Comparison of the thermal stabilities of MDX-060 CHO, MDX-060 LEX, 
25 and glyco-optimized MDX-060 LEX^^ mAbs by differential scanning calorimetry 

(DSC). 



Antibody 


Tml CC) 


Tm2 rC) 


T,„3 (°C) 


MDX-060 CHO 


72 


75 


84 


MDX-060 LEX 


71 


75 


84 


MDX-060 LEX**P' 


72 


76 


84 
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N-glycan structures ofMDX-060 CHO, LEX, and LEX^^' mAbs, 

The Af-glycan profiles of recombinant MDX-060 CHO, MDX-060 LEX. and 
MDX-060 LEX°P* derived mAbs were determined by negative reflectron mode 
MALDT-TOF MS and normal phase (NP) HPLC-QTOF MS. The structures oiN- 
glycans referred to in the following discussion are shown in Figure S3. 

MALDI-TOF MS analysis of A^-glycans from MDX-060 CHO lines indicated 
the presence of four major A^-glycans with m/z values corresponding to 2-AA labeled 
GnGnF^ (nomenclature derived from http://www.proglycan.com), Man5, GnAjsoF*^, 
and AAF^ (Figure 42). NP-HPLC separated the GnAisoF^ A^-glycan into its two 
isoforms (Gal attached to the a-l,6-Man or a-l,3-Man arm) bringing the total number 
of major A^-glycans found on MDX-060 CHO to five (Figure 43). MS/MS 
fragmentation of the peaks was not conducted to confirm the identity of each isoform; 
however, the higher abundance of the earlier peak suggested that Gal was attached to 
the a-l,6-Man arm of this /V-glycan (Shinkawa ei al (2003) J, Biol Chem. 278:3466- 
3473; Zhu et al (2005) Nat, Biotechnol 23: 1 159-1 169). On-line negative mode 
QTOF MS analysis gave m/z values corresponding to doubly charged GnGnF^, Man5, 
GnAisoF^ (both isoforms), and AAF^ confirming the MALDI-TOF MS results (Table 
3 below). Peak integration of the fluorescent trace revealed that GnGnF^ Man5, 
AGnF^ GnAF^ and AAF^ constituted 50.8, 2.5, 26.1, 10.7 and 6.8%, respectively, of 
the total A^-glycan structures from MDX-060 CHO. The remaining 3.1% of A^-glycans 
were found to be a mixture of GnGn, GnMisoF^ GnMjso, and MM with no structure 
higher than 1.2% of the total (data not shown). 
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Table 3. Summary of observed MALDl-TOF and QTOF MS masses of the major 2-AA 
labeled A/-glycans from MDXA-060 mAbs produced by CHO cells (CHO), wild-type Lemna 
(LEX) or glyco-optimized Lemna lines expressing the RNAi construct (LEX^^). 



iV-glycan name" 


Proposed 

OLl UVt-UI V 


Theoretical miz 


Observed 


Observed Q- 


% Peak 


CHO 




[M-H]- 


[M-2H]^- 


[M-H]' 


[M-2H]^- 




GnGnF^-2AA 




1 coo COA 


nc\c\ TQl 1 

/yu./yi 1 


1582.455 


790.7825 


50.8 


Man5-2AA 




1354 47Q 


676 7436 

\J f KJm t ~J\J 


1354.392 


676.7343 


2.50 


GnAisoF^-2AA 




1744.642 


871.8175 


1744,492 


871.7970 


36.8 


AAF*-2AA 




1906.695 


952.8438 


1906.567 


952.8181 


6.80 


LEX 














GnGn-2AA 




1436.532 


717.7622 


1436.549 


717.7894 


8,40 


GnGnX-2AA 




1568.574 


783.7833 


1568.581 


783.8150 


17.2 


GnGnXF^-2AA 




1714.632 


856.8122 


1714.615 


856.853 


67.4 


LEX?''' 














GnGn-2AA 




1436.532 


717.7622 


1436.523 


717.7993 


95.8 



A^-glycan names are based on Proglycan (http://www.proglycan.com) nomenclature. 
2AA, 2-aminobenzoic acid. 

^^e symbols of the proposed A^-glycan structures are as follows: ^, N-- 
acetylglucosamine; © , mannose; • , galactose; ^ , xylose; ▲ , a-l,3-fucose; 
a- 1 ,6-fucos^ , 2-aminobenzoic acid. 

*^he m/z values and the % peak area of each A^-glycan structure were obtained from 
Figure 2. 



5 MALDT-TOF MS analysis of wild-type MDX-060 LEX m Ab revealed the 

presence of three major species with m/z values corresponding to GnGnXF"', GnGnX 
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and GnGn (Figure 42). NP-HPLC-QTOF MS analysis showed three major 
fluorescent peaks with m/z values corresponding to doubly charged GnGnXF^ 
GnGnX and GnGn, again confirming the MALDI-TOF MS results (Figure 43; Table 
3). Integration of the fluorescent peaks indicated that GnGnXF^, GnGnX and GnGn 
5 constituted 67.4, 17.2 and 8.4%, respectively, of the total A^glycans derived from the 

MDX-060 LEX mAb. The remaining 7% of A/-glycans were determined to be a 
mixture of MM, GnMiso, MMXF^ GnGnF^ GnMisoXF\ Man6, Man7, Gn(FA)isoXF\ 
Man8 and Man9 with no A^-glycan greater than 2% of the total. Similar results were 
seen with mAbs isolated from two independently transformed MDX-060 LEX lines 

10 (data not shown). The simple array of A^-glycans on LEX mAbs demonstrated here 

provides an amenable starting point for glyco-optimization. 

In contrast to the MDX-060 LEX mAb, //-glycans from the MDX-060 LEX^^ 
mAb possessed GnGn as the major //-glycan species by both MALD-TOF MS and 
NP-HPLC-QTOF MS analysis (Figures 42 and 43; Table 3). GnGn comprised 95.8% 

15 of the total iV-glycans with the remaining 4.2% of A/-glycans determined to be MM, 

GnMiso, GnAiso, Man6, Man7 and Man8 with no one structure greater than 1.2% of 
the total AT-glycans. None of the LEX^^ A^-glycans contained fucose (Fuc) or xylose 
(Xyl). These results demonstrated that co-expression of an RNAi construct targeting 
Lemna FucT and XylT resulted in the complete elimination of Fuc and Xyl-containing 

20 A^-glycans from MDX-060 LEX^* mAbs and produced highly homogeneous mAb 

glycoforms. The same results were obtained for MDX-060 LEX^'mAb harvested 
from an independent transgenic line (line 225) comprising the MDXA04 expression* 
vector, at a different growth scale (300 g tissue for transgenic line 225 versus 1 g 
tissue for transgenic line 52, which produced the MDX-060 LEX^^mAb having the 

25 A/^glycan profile shown in Figures 42 and 43. Unlike mammalian cell culture systems 

where A^-glycan heterogeneity can change with culture conditions, growth scale, and 
growth period (Kanda et ai (2006) Biotechnol. Bioeng. 94:680-688), the glycan 
uniformity observed with LEX^^ mAbs was shown to be consistent under a variety of 
growth conditions and scales. 

30 The absence of Fuc or Xyl on MDX-060 LEX°p' mAb A^-glycans was fiirther 

confirmed by monosaccharide analysis (Table 4 below). Monosaccharides were 
released from MDX-060 CHO, LEX and LEX^^ mAbs by acid hydrolysis and 
analyzed by high performance anion exchange chromatography (HPAEC) coupled to 
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10 



15 



pulsed amperometric detection (PAD). The monosaccharide ratios for Man and 
GlcNAc residues were similar for CHO and wild-type LEX mAbs and correlated well 
with expected values. LEX mAbs were significantly decreased in Gal and Fuc 
content and had a significant increase in Xyl when compared to CHO-derived mAbs. 
Monosaccharide analysis of Lemna derived mAbs revealed that while Fuc and Xyl 
were present on wild-type LEX A^-glycans, they were not detected on LEX^^ mAbs. 
Collectively, these results demonstrate that co-expression of an RNAi construct 
targeting Lemna XylT and FucT results in the eleimination fo Fuc and Xyl-containing 
A^-glycans from MDX-060 LEX^^ mAbs and produce highly homogeneous mAb 
glycoforms. The robustness of this glyco-optimization strategy has been confirmed 
with multiple independent plant lines expressing the MDX-060 LEX^^ mAb as well 
as with other mAbs expressed in the LEX System, for example, the mAbl monoclonal 
antibody discussed in the examples herein above. In these subsequent 
transformations, glyco-optimized mAb expression levels up to 6% of total soluble 
protein (TSP) have been obtained. 



20 



Table 4. Monosaccharides released from MDX-060 CHO, LEX and LEX^^ mAbs by 
acid hydrolysis and analyzed by HPAEC-PAD. The monosaccharide content from 
each mAb was determined by normalizing against carbohydrate controls. 



Monosaccharide 


MDX-060 CHO 
pmol (% total) 


MDX-060 LEX 
pmol (% total) 


MDX-060 
LEX°P' 
pmol (% total) 


Fuc 


254 (20) 


232(13) 


0 


GicNAc 


605 (47) 


773 (45) 


1.003 (67) 


Gal 


75(6) 


0 


0 


Man 


355 (27) 


491 (29) 


501 (33) 


Xyl 


0 


226 (13) 


0 


Total 


1,289 (100) 


1,722(100) 


1,504(100) 
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Functional activity ofMDX-060 CHO, LEX and LEX^^' mAbs . 

Antigen binding properties of the MDX-060 CHO, MDX-060 LEX, and 
MDX-060 LEX^^ mAbs were determined using CD30 expressing L540 cells. All 
three mAbs had nearly identical binding curves (Figure 43). EC50 concentrations 
5 were determined to be 0. 1 80 Mg/mL, 0.227 ng/mL, and 0. 1 96 Mg/mL for MDX-060 

CHO, LEX, and LEX^^ respectively (Figure 44), indicating that antigen binding for 
all three mAbs were similar. 

Fc-receptor-mediated effector cell function has been shown to be important for 
the in vivo activity of many therapeutic mAbs. Since the FcR expressed on NK cells 

10 and macrophages responsible for ADCC activity is FcyRIIla, binding of the various 

mAbs to this receptor was compared. FcR binding of CHO, LEX and LEX^* mAbs 
was determined by equilibrium binding of the mAbs with effector cells expressing 
two different human FcRyHIa allotypes (Phe^^^ or Val^^^). MDX-060 LEX had a 1.7- 
fold increase in FcRylllaPhe^^^ and a 0.4-fold decrease in FcRylHaVal*^^ binding 

15 compared to the CHO-derived mAb, demonstrating that receptor binding for CHO 

and LEX mAbs were similar. In contrast, LEX^^ mAbs had a 27 and 15-fold higher 
affinity for FcRylHaPhe'^^ and FcRyI^aVal^^^ respectively, than CHO mAbs (Figure 
45). These results suggested that RNAi silencing of the Lemna FucT and XylT 
activities in LEX^^ lines produced mAbs with enhanced FcR binding. 

20 ADCC activities of the CHO, LEX and LEX^' mAbs were determined by 

I CO 

incubating mAbs with either homozygous (FcRylllaPhe ) or heterozygous 
(FcRylHaPheA^al^^^) human effector cells and BATDA 

(bis(acetoxymethyl)2,2':6',2"-terpyridine-6,6"-dicarboxylate) labeled L540 target 
cells (Figure 45). MDX-060 LEX mAbs (31%) had the same maximal percent cell 

25 lysis as CHO mAbs (31%) using heterozygous FcRylllaPhe/Val*^^ human effector 

cells (Figure 46) with similar EC50 values. Maximal percent cell lysis for LEX mAbs 
(27%) was slightly increased compared to CHO mAbs (15%) using homozygous 
FcyRHIa Phe/Phe^^^ effector cells. Importantly, LEX^* mAbs had significantly 
increased ADCC activity compared to MDX-060 CHO and LEX mAbs, irrespective 

30 of the donor genotype. This was assessed by both an increase in potency and 

efficacy. Maximal percent lysis for MDX-060 Lex^' was 45% for both experiments, 
while the EC50 value was 3 to 5 times lower than MDX-060 LEX and MDX-060 
CHO mAbs, respectively, for FcyRHIa Val/Phe^^^ effector cells and 2 to 3 times lower 
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for the FcyRITIa Phe/Phe^^^ effector cells. These results demonstrate that removal of 
Fuc and Xyl-containing A^-glycans from MDX-060 LEX^^ mAbs caused an 
enhancement in ADCC activity and hence can improve their therapeutic potential, 

5 RP'HPLC-Q'TOF MS analysis of intact IgG for MDX-060 LEX and MDX-060 

LEX^\ 

Protein A purified IgG's (50 ^g) were desalted using the Waters 2695 HPLC 
system fitted with a Poros Rl-10 column (2 mm x 30 mm; Applied Biosystems). 

10 IgG's were detected and analyzed using a Waters 2487 dual wavelength UV detector 

(280 nm) and the Waters Q-TOF API US. Separations were conducted at 0. 15 
mL/min, 60°C, using 0.05% (v/v) trifluoroacetic acid (TFA; solvent A) and 0.05% 
(v/v)TFA, 80% (v/v) acetonitrile (solvent B). Sample elution was carried out using a 
linear increase from 30 to 50% B for 5 min, an increase to 80% B for 5 min. The 

15 solvent ratio remained at 80% B for an additional 4 min, followed by a wash with 

100% B for 1 min and equilibration of the column with 30% B for 15 min prior to the 
next run. 

Q-TOF analysis was conducted in positive ion mode with source and 
desolvation temperatures of lOO^C and 300°C, respectively, and capillary and cone 
20 voltages of 3.0 and 60 V, respectively. Data are the result of combining >100 

individual scans and deconvolution to the parent mass spectrum using MaxEnt 1. 

See also Triguero et ai (2005) Plant Biotechnol ./. 3: 449-457; Takahashi et 
ai {199^) AnaL Biochem, 255: 183-187; DWXon etai (2004)7. Chromatogr, A, 1053: 
299-305. 

25 Figure 47 shows intact mass analysis of the MDX-060 LEX mAb 

compositions produced in wild-type L minor comprising the MDXAOl construct. 
When XylT and FucT expression are not suppressed in L. minor, the recombinantly 
produced MDX-060 LEX mAb composition comprises at least 7 different 
glycoforms, with the GOXF^ glycoform being the predominate species present. Note 

30 the absence of a peak representing the GO glycoform. 

Figure 48 shows glycan mass analysis of the heavy chain of the MDX-060 
LEX mAb produced in wild-type L, minor comprising the MDXAOl construct. When 
XylT and FucT expression are not suppressed in L minor, the predominate A^-glycan 
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species present is GOXF^, with additional major peaks reflecting the GOX species. 
Note the minor presence of the GO glycan species. 

Figure 49 shows intact mass analysis of the MDX-060 LEX^ mAb 
compositions produced in transgenic L, minor comprising the MDXA04 construct. 
5 When XylT and FucT expression are suppressed in L minor, the intact mAb 

composition contains only GO A^-glycans. In addition, the composition is substantially 
homogeneous for the GO glycoform (peak 2), wherein both glycosylation sites are 
occupied by the GO A^-glycan species, with two minor peaks reflecting trace amounts 
of precursor glycoforms (peak 1, showing mAb having an Fc region wherein the Ch2 

1 0 domain of one heavy chain has a GO glycan species attached to Asn 297, and the Ch2 

domain of the other heavy chain is unglycosylated; and peak 3, showing mAb having 
an Fc region wherein the Asn 297 glycosylation site on each of the Ch2 domains has a 
GO glycan species attached, with a third GO glycan species attached to an additional 
glycosylation site within the mAb structure). 

1 5 Figure 50 shows glycan mass analysis of the heavy chain of the MDX-060 

L£^opt j^jj produced in transgenic L, minor comprising the MDXA04 construct. 
When XylT and FucT expression are suppressed in L. minor, the only readily 
detectable iV-glycan species attached to the Asn 297 glycosylation sites of the Ch2 
domains of the heavy chains is GO. 

20 

DISCUSSION 

Glyco-optimization of MDX-060 was accomplished by co-expression with an 
RNAi cassette aimed at silencing the endogenous Lemna FucT and XylT genes. This 
simultaneous silencing of both FucT and XylT genes was achieved using a single 

25 RNAi transcript. The absence of Fuc and Xyl on the LEX^* mAb was confirmed by 

MALDI-TOF, NP-HPLC-QTOF MS, and monosaccharide analysis of iV-glycans 
purified from the MDX-060 LEX^' mAb. These analyses corroborate the lack of 
transferase activity observed in microsomal membranes. Importantly, >95% of the N- 
glycans released from LEX^^ mAbs were of a single structure, GnGn, indicating that 

30 this strategy had the added benefit of producing mAbs with a homogeneous A^-glycan 

profile. MDX-060 LEX and LEX^ mAbs were found to be indistinguishable with 
regard to thermal stability and antigen binding compared to MDX-060 CHO. 
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Electrophoretic analysis was also found to be similar for all three mAbs. In fact, the 
only structural differences detected were in the mAb A^glycan profiles. 

Without being bound by theory, the ability of the MDX-060 LEX^ lines to 
produce mAbs with a single major A^-glycan species may be based on the more 
5 uniform mAb glycoform distribution found in wild-type Lemna. A^-glycans released 

from mAbs purified from wild-type tobacco (Fujiyama ei al (2006) J. Biosci. Bioeng. 
101:212-218; Elbers et al (2001) Plant Physiol 126: 13 14-1322; Bakker et al (2001) 
Proc. Natl Acad, ScL 98:2899-2904), alfalfa (Bardor et al (2003) Plant Biotechnol 
J, 1 :45 1-462), and moss (Koprivova et al (2003) Plant Bio, 5:582-591) show that 

10 mAb glycoform heterogeneity in plants with wild-type A^-glycosylation can range 

from five (alfalfa) (Bardor et al (2003) Plant Biotechnol J, 1 :45 1 -462)to eight 
(tobacco) (Fujiyama et al (2006) J. Biosci Bioeng, 101:212-218) different major 
structures. MDX-060 LEX possesses only three major //-glycan structures (GnGn, 
GnGnX and GnGnXF^). This simple array of //-glycans on mAbs produced by wild- 

15 type Lemna may provide a more amenable starting point for glyco-optimization 

leading to greater homogeneity than that observed in other systems. 

Fc-receptor mediated effector cell function has been shown to be important for 
the in vivo activity of many therapeutic mAbs. In this study, the ADCC activity of 
MDX-060 CHO, MDX-060 LEX, and MDX-060 LEX^' mAbs was compared. Since 

20 the FcR expressed on NK cells and macrophages responsible for ADCC activity is 

FcyRJIIa, the binding of the various mAbs to this receptor was also compared. The 
results discussed above show that MDX-060 LEX^* mAb has an increased binding 
affinity (15-25 fold) and maximal binding (4-5 fold) to FcyRIIIa as well as enhanced 
ADCC activity compared to MDX-060 CHO and MDX-060 LEX mAbs. The 

25 removal of a- 1,6- linked Fuc from various mAbs produced in other expression systems 

has been shown previously to increase FcR binding and enhance ADCC function 
(Shinkawa et al (2003) J, Biol Chem, 278:3466-3473; Shields et al (2002) J. Biol 
CAe/n. .277:26733-26740; Niwae/ a/. (2004) C//«. Cancer Res, 10:6248-6255). The 
results presented herein suggest that removal of the a-l,3-linked Fuc from the MDX- 

30 060 LEX^* mAbs has the same effect on mAb function as the removal of a-L6- 

linked Fuc. 

In this study, two naturally occurring polymorphic isoforms of FcyRJIIa at 
residue 158'*\ Val*^^ and Phe^^^ were evaluated. MDX-060 LEX^' shows a higher 
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binding affinity to FcyRJTTa-Val^^^ compared to FcyRIDa-Phe^^^as has been observed 
with other IgGl mAbs (Shields et al (2002)7. Biol Chem. 277:26733-26740). The 
fact that an increase in binding with MDX-060 LEX*^ was observed with both 
isoforms is important since differential binding to Val^^^ over Phe*^* was found to be 
S predictive of the clinical and immunological responsiveness of certain patient groups 

receiving anti-CD20 treatment (Cartron et al (2002) Blood 99:754-758; Weng et al 
(2003)7. Clin, Oncol 21:3940-3947; Weng era/. (2004)7. Clin, Oncol 22:4717- 
4724). This increase in binding has been hypothesized to result in a higher percentage 
of patients responding to treatment that requires Fc functionality. 

10 A similar increase in ADCC activity was also observed. In this study, the 

MDX-060 LEX^* mAb showed an increase in cell lysis and a decrease in the EC50 
value, resulting in an increase in efficacy and potency when compared to MDX-060 
CHO. This corresponds to a 20-fold increase in activity, determined by taking the 
maximum percent lysis of MDX-060 CHO and calculating the concentration of 

15 MDX-060 LEX^* mAb giving rise to the same percent cell lysis. As with the 

FcyRHIa binding study, the increase in ADCC activity was observed with both a 
homozygous FcyRIIIaPhe/Phe*^^ and a heterozygous FcyRIIIa PheA^al^^* effector cell 
donor. The results presented here suggest that removal of the a -1 ,3-linked Fuc from 
the MDX-060 LEX^^ mAbs has the same effect on niAb function as the removal of 

20 a-l,6-linked Fuc. 

The robustness of this glyco-optimization strategy has been demonstrated with 
multiple independent Lemna plant lines expressing the MDX-060 LEX^^ mAb as 
well as with other mAbs expressed in the Lemna expression system (see, for example, 
Examples 2-4 above). Furthermore, there is no apparent difference in plant phenotype 

25 or growth rate compared with wild-type Lemna plants. Unlike mammalian cell 

culture systems where A^-glycan heterogeneity can change with culture conditions, 
growth scale and growth period*, the glycan uniformity observed with LEX^* mAbs 
has been shown to be consistent under a variety of growth conditions and scales (data 
not shown). 

30 In conclusion, an RNAi strategy was used to produce a glyco-optimized anti- 

CD30 antibody in the Lemna expression system. The resulting mAb consists of a 
single, major iV-glycan structure, without any evidence of the plant-specific Fuc and 
Xyl residues. In addition, the resulting optimized mAb has increased ADCC activity 
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and FcyRIIIa binding activity compared to a CHO-derived mAb. The homogeneous 
glycosylation profile obtained on mAbs produced in a Lemna expression system 
having this FucT+XylT gene knockout strategy makes it is possible to express these 
mAbs with increased production consistency. 

5 

Example 7: Scale-up Production of Glvcan-Qptimized mAbI in Lemna minor 

L minor transgenic line 24 comprising the niAbI04 construct of Figure 12 
(providing for suppression of FucT and XylT) was generated in a manner similar to 
that described above. Following its generation, transgenic line 24 was continuously 

10 maintained by clonal culture, wherein periodically a subsample of the plant culture 

was transferred to fresh culture medium for further culturing. This transgenic line 
was analyzed for the A^-glycosylation pattern of the recombinantly produced mAbI 
antibody following production scale-up from 1 g tissue up to 300 g (0.3 kg) tissue, 
and further production scale-up to 6.5 kg tissue. The process of scaling production up 

15 to 6.5 kg tissue occurred approximately 8 months after transgenic line 24 was 

generated. Results are shown in Figures 51A (MALDI-TOF analysis of iV-glycans) 
and 5 1 B (HPLC fluorescence analysis of A^-glycans). 

The glycosylation profile for the mAbI antibody produced by transgenic line 
24 comprising the mAbI04 construct remained homogeneous with scale-up in 

20 production from 1 g tissue to 0.3 kg tissue, and further scale-up in production to 6.5 

kg tissue, and thus was characterized by the presence of a single predominant peak 
corresponding to the GnGn (i.e., GO) glycan species. Thus, the homogeneity of the 
glycosylation profile in transgenic L minor comprising the mAbI04 construct was 
consistently maintained with an approximately 6,500-fold increase in production scale 

25 (i.e., from 1 g up to 6.5 kg). Furthermore, the homogeneity of the glycosylation 

profile was consistently maintained in this transgenic line at 8 months following its 
generation. 

These data demonstrate that the homogeneity of the glycosylation profile in 

I 

transgenic L minor comprising the mAbI04 construct remains consistent with at least | 
30 a 6,500-fold increase in production scale. Furthermore, the homogeneity of the I 

glycosylation profile in transgenic L minor comprising the mAbI04 construct is 
maintained for at least 8 months after the transgenic line is generated. The 

homogeneity of the glycosylation profile would be expected to be maintained with I 
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further increase in production scale, and thus, for example, would be expected to be 
maintained if production scale was increased by another 4-fold beyond 6.5 kg (e.g., 
scale-up from 6.5 kg to 26 kg). The homogeneity of the glycosylation profile would 
also be expected to be maintained with continuous clonal culture of the transgenic line 
5 well beyond 8 months after generation of the transgenic line. 

Example 8: Glvcosvlation Pattern for Endogenous Glycoproteins in Lemna minor 
Lines Transgenic for mAbI04 RNAi Construct 

The L40 protease is a representative endogenous glycoprotein produced in I. 

10 minor. In order to assess the impact of the mAbI04 RNAi construct (Figure 12) on 

glycosylation of endogenous proteins, the L40 protein was isolated from a L minor 
line transgenic for the mAbI04 RNAi construct using benzamidine affinity 
chromatography. The AZ-glycosylation pattern for the isolated L40 protein was 
analyzed using MALDl-TOF analysis in the manner described above. Results are 

1 5 shown in Figure 52, 

As can be seen from this analysis, suppression of FucT and XylT expression 
using the chimeric RNAi mAbI04 construct results in endogenous glycoproteins 
having a homogeneous glycosylation pattern consistent with that observed for 
recombinant glycoproteins. Thus, the heterogeneous A^-glycan profile for the L40 

20 glycoprotein isolated from L minor having the wild-type glycosylation machinery is 

represented by a mixture of iV-glycans species having the pi,2-linked xylose residue, 
or both the pi,2-linked xylose and core al,3-linked fucose residues attached. In 
contrast, the homogeneous A^-glycan profile for L40 isolated from L minor transgenic 
for the mAbI04 RNAi construct is represented by a single predominant peak 

25 corresponding to the GO glycan species, and is characterized by the absence of A^- 

glycan species having the pi,2-linked xylose or both the pi,2-linked xylose and core 
al,3*linked fucose residues attached. 

Example 9: Production of Anti-CD20 and Anti-HER2 Monoclonal Antibodv Having 
30 Increased ADCC Activitv 

1DEC-C2B8 (IDEC Pharmaceuticals Corp., San Diego, California; 
commercially available under the tradename Rituxan®, also referred to as rituximab; 
see U.S. Patent No. 5,736,137, herein incorporated by reference) is a chimeric anti- 
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CD20 monoclonal antibody containing human IgGl and kappa constant regions with 
murine variable regions isolated from a murine anti-CD20 monoclonal antibody, 
IDEC-2B8 (Rcffet al (1994) Blood 83:435-445). Rituximab is licensed for treatment 
of relapsed B cell low-grade or follicular non-Hodgkin's lymphoma (NHL). The anti- 
5 CD20 antibody marketed as rituximab (Rituxan®) is recombinantly produced in CHO 

cells. The glycosylation pattern of this CHO-expressed anti-CD20 antibody reveals a 
heterogeneous mixture of glycoforms. 

A humanized anti-ERBB2 antibody is commercially available under the 
tradename Herceptin® (Genentech, Inc., San Francisco, California) (see U.S. Patent 

10 No, 6,165,464, herein incorporated by reference). This recombinant humanized 

monoclonal antibody has high affinity for pl85H£R2. Early clinical trials with 
patients having extensive metastatic breast carcinomas demonstrate the ability of this 
monoclonal antibody to inhibit growth of breast cancer cells that overexpress HER2 
(Baselga^M/, (1996)7. Clin, Oncol 14(3):737.744). 

1 5 A rituximab-sequence antibody and Herceptin® anti-ERBB2-sequence 

antibody are recombinantly produced in Lemna having a wild-type glycosylation 
pattern, using the mAblOl construct described above, and in Lemna that is genetically 
modified to suppress expression of both FucT and XylT, using the mAbl04 chimeric 
RNAi construct, with the rituximab or anti-ERBB2 heavy and light chain coding 

20 sequences replacing those for the mAbI in each of these constructs. The sequences 

encoding the heavy and light chains are optionally both codon-optimized with Lemna- 
preferred codons. The secreted rituximab-sequence mAb (i.e., having the amino acid 
sequence of the rituximab antibody) and anti-ERBB2-sequence mAb (i.e., having the 
amino acid sequence of the Herceptin® anti-ERBB2 mAb) are analyzed for 

25 glycosylation pattern as described above in Example 1 . 

The glycosylation profile for intact rituximab-sequence mAb or anti-ERBB2- 
sequence mAb produced in the wild-type Lemna comprising the mAblOl-like 
construct shows a heterogeneous profile with numerous peaks corresponding to 
multiple glycoforms. In contrast, the glycoyslation profile for intact rituximab- 

30 sequence mAb or intact anti-ERBB2-sequence mAb produced in transgenic Lemna 

comprising the mAbI04-like construct shows a substantially homogeneous 
glycoprotein composition, with three major glycoform peaks, the largest of which 
corresponds to the GO glycoform, and two very minor peaks corresponding to trace 
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amounts of precursor glycoforms, wherein xylose and fucose residues are not 
attached. 

The rituximab-sequence and anti-ERBB2-sequence monoclonal antibody 
compositions having a glycosylation profile that is substantially homogeneous for the 
5 GO glycoform are tested for ADCC activity and Fey receptor nia (Fey Rllla) binding 

on freshly-isolated human NK cells. The rituximab-sequence and anti-ERBB2- 
sequence mAbs produced from L minor lines engineered with the mAbI04-like 
construct exhibit the expected enhanced binding in view of the lack of any fucose 
residues, relative to the binding observed for the rituximab-sequence and anti- 

10 ERBB2-sequence mAbs produced from L minor lines having the wild-type 

glycosylation machinery (i.e., no silencing of FucT or XylT). Furthermore, binding 
affinity is at least strong as for the corresponding mAb produced in CHO cell lines. 

ADCC activity of the substantially homogeneous GO glycoform of rituximab- 
sequence mAb and of anti-ERBB2 -sequence mAb produced in the L minor line 

15 engineered with the mAbI04-like construct is assayed using purified human peripheral 

blood mononuclear cells as effector cells (see, for example, Shinkawa et al (2003) J. 
Biol. Chem. 278(5):3466-3473; herein incorporated by reference in its entirety). 
Activity of the GO glycoform rituximab-sequence and anti-ERBB2-sequence mAb 
compositions is improved 50-1000 fold over that exhibited by the respective 

20 rituximab-sequence mAb or anti-ERBB2-sequence mAb produced in the L minor line 

having the wild-type glycosylation machinery or produced in CHO cells. 

CDC activity of the substantially homogeneous GO glycoform of rituximab- 
sequence mAb produced in the L minor line engineered with the niAbI04-like RNAi 
construct is assayed using standard assays known in the art (see, for example, the 

25 complement activation assays described in U.S. Patent Application Publication No. 

2004/01673 19, herein incorporated by reference in its entirety), and compared to that 
observed for rituximab. Non-relevant antibody serves as the negative control. In one 
such assay, CDC activity of the various antibodies against target Daudi cells is 
measured by assessing elevated membrane permeability using a propidium iodide (PI) 

30 exclusion assay, with serum from healthy volunteers serving as a complement source. 

Serum for complement lysis is prepared by drawing blood from healthy voliuiteers 
into autosep gel and clot activator vacutainer tubes (BD Biosciences, Rutherford, New 
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Jersey), which are held at room temperature for 30-60 minutes and then centrifuged at 
3000 rpm for 5 minutes. Serum is harvested and stored at -80°C. 

Briefly, for this CDC activity assay, Daudi cells are washed and resuspended 
in RPMI-1% BSA at I X 10^/ml. Various concentrations of the substantially 

5 homogeneous GO glycoform of rituximab-sequence mAb, rituximab, and negative 

control mAb are added to the Daudi cells and allowed to bind for 10-15 minutes at 
room temperature. Thereafter, serum as a source of complement is added to a final 
concentration of 20% (v/v) and the mixtures are incubated for 45 min at 37°C. The 
cells are then kept at 4°C until analysis. Each sample (150 ^il) is then added to 10 nl 

10 of PI solution (10 ng/ml in PBS) in a FACS tube. The mixture is assessed 

immediately for cell lysis (number of Pl-positive cells) by a FACScalibur flow 
cytometer and analysed using CellQuest pro software (BD Biosciences, Mountain 
View, California), At least 5000 events are collected for analysis with cell debris 
excluded by adjustment of the forward sideward scatter (PCS) threshold. 

1 5 CDC activity of the substantially homogeneous GO glycoform rituximab- 

sequence mAb is decreased relative to that exhibited by rituximab. 



20 All publications and patent applications mentioned in the specification are 

indicative of the level of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 
individually indicated to be incorporated by reference. 
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10 



THAT WHICH IS CLAIMED: 

1 . A transgenic higher plant that has been stably transformed to express 
glycoproteins with an altered A^-glycosylation pattern, wherein said altered N- 
glycosylation pattern is characterized by a reduction in the attachment of al,3-fucose 
and pi,2-xylose residues to A^-glycans of said glycoproteins. 

2. The transgenic higher plant of claim 1 , wherein said //-glycans are 
devoid of said fucose and xylose residues. 

3. The transgenic higher plant of claim 1 or claim 2, wherein said plant 
has the ability to produce a mammalian glycoprotein having said altered N- 
glycosylation pattern. 



1 5 4. The transgenic higher plant of claim 3, wherein said glycoprotein is a 

monoclonal antibody of interest. 

5. The transgenic higher plant of claim 4, wherein said monoclonal 
antibody has increased binding affmity for an FcyRIII, increased antibody-dependent 
20 cellular cytotoxicity (ADCC), decreased complement-dependent cytotoxicity (CDC) 

activity, or any combination thereof, as a result of said altered A^-glycosylation pattern 
of said monoclonal antibody. 



6. A duckweed plant or duckweed plant cell or nodule that expresses 
25 glycoproteins having an altered A^-glycosylation pattern, wherein said altered A^- 

glycosylation pattern is characterized by a reduction in the attachment of al,3-fucose 
residues, pi,2-xylose residues, or both al,3-fucose residues and pi,2-xylose residues 
to A^-glycans of said glycoproteins. 

30 7. The duckweed plant or duckweed plant cell or nodule of claim 6, 

wherein said A^-gl yeans are devoid of said fucose residues, devoid of said xylose 
residues, or devoid of said fucose and xylose residues. 
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8. The duckweed plant or duckweed plant cell or nodule of claim 6 or 
claim 7, wherein said duckweed plant or duckweed plant cell or nodule has the ability 
to produce a mammalian glycoprotein of interest. 

5 9. The duckweed plant or duckweed plant cell or nodule of claim 8, 

wherein said glycoprotein is a monoclonal antibody of interest. 



10. The duckweed plant or duckweed plant cell or nodule of claim 9, 
wherein said monoclonal antibody has increased binding affinity for an FcyRin, 
10 increased antibody-dependent cellular cytotoxicity (ADCC), decreased complement- 

dependent cytotoxicity (CDC) activity, or any combination thereof, as a result of said 
altered iV-glycosylation pattern of said monoclonal antibody. 



11. A composition comprising a glycoprotein produced by the duckweed 
1 5 plant or duckweed plant cell or nodule of any one of claims 6 through 10. 



12. The composition of claim 1 1, wherein said //-glycans are devoid of 
said fucose and xylose residues. 

20 13. The composition of claim 1 2, wherein said glycoprotein has an A^- 

glycosylation profile characterized by the presence of a single predominant peak 
corresponding to the GO glycan species. 



14. The composition of claim 13, wherein said glycoprotein is a 
25 monoclonal antibody of interest. 



1 S. A method for altering the 7V-glycosylation pattern of a heterologous 
polypeptide produced in a duckweed plant or duckweed plant cell or nodule, said 
method comprising: 

30 (a) inhibiting expression or function of at least one al,3- 

fucosyltransferase (FucT) in said a duckweed plant or duckweed plant cell or nodule, 
wherein said FucT comprises an amino acid sequence having at least 90% sequence 
identity to the sequence set forth in SEQ ID N0:3; and 
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(b) culturing said duckweed plant or duckweed plant cell or nodule 
under conditions suitable for expression of said heterologous polypeptide. 

1 6. The method of claim 1 5, wherein said FucT comprises an amino acid 
5 sequence having at least 95% sequence identity to the sequence set forth in SEQ ID 

N0:3. 



10 



1 7. The method of claim 16, wherein said FucT comprises the amino acid 
sequence set forth in SEQ ID N0:3. 



1 8. The method of claim 1 S, wherein the expression or function of said 
FucT is inhibited by a method selected from the group consisting of: 

(a) introducing a polynucleotide or polypeptide into said duckweed 
plant or duckweed plant cell or nodule, wherein said polynucleotide or polypeptide 

IS inhibits expression or function of said FucT in said a duckweed plant or duckweed 

plant cell or nodule; 

(b) eliminating a gene in said duckweed plant or duckweed plant 
cell or nodule, wherein said gene encodes said FucT; and 

(c) mutating a gene in said duckweed plant or duckweed plant cell 
20 or nodule, wherein said gene encodes said FucT. 

19. The method of claim 18, comprising introducing into said duckweed 
plant or duckweed plant cell or nodule a nucleotide construct comprising a first 
nucleotide sequence that is capable of inhibiting expression or function of an al,3- 

25 fixcosyltransferase (FucT) in said a duckweed plant or duckweed plant cell or nodule, 

wherein said first nucleotide sequence is operably linked to a promoter that is 
functional in a plant cell. 



20. The method of claim 1 9, wherein said first nucleotide sequence 
30 comprises a sequence selected from the group consisting of: 

(a) the nucleotide sequence set forth in SEQ ID NO: 1 or a 
complement thereof; 
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(b) the nucleotide sequence set forth in SEQ ID N0:2 or a 
complement thereof; 

(c) a nucleotide sequence having at least 90% sequence identity to 
the sequence of preceding item (a) or (b); and 

5 (d) a fragment of the nucleotide sequence of any one of preceding 

items (a) through (c), wherein said fragment comprises at least 75 contiguous 
nucleotides of said nucleotide sequence. 

21. The method of claim 19, wherein said first nucleotide sequence 
10 comprises in the 5*-to-3' orientation and operably linked: 

(a) a FucT forward fragment, said FucT forward fragment 
comprising about 500 to about 800 contiguous nucleotides having at least 90% 
sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 
nucleotides of SEQ ID NO: 1 or SEQ ID N0:2; 
15 (b) a spacer sequence comprising about 200 to about 700 

nucleotides; 

(c) and a FucT reverse fragment, said FucT reverse fragment 
having sufficient length and sufficient complementarity to said FucT forward 
fragment such that said first nucleotide sequence is transcribed as an RNA molecule 
20 capable of forming a hairpin RNA structure. 

22. The method of claim 21, wherein said FucT reverse fragment 
comprises the complement of said FucT forward fi*agment or a sequence having at 
least 90% sequence identity to the complement of said FucT forward fragment. 

25 

23. The method of claim 2 1 or 22, wherein said FucT forward fragment 
comprises nucleotides (nt) 255-985 of SEQ ID NO:l and said spacer sequence 
comprises an intron. 

30 24. The method of claim 1 5, further comprising inhibiting expression or 

function of at least one pl,2-xylosyltransferase (XylT) in said duckweed plant or 
duckweed plant cell or nodule, wherein said XyltT comprises an amino acid sequence 
having at least 90% sequence identity to the sequence set forth in SEQ ID N0:6 or 
SEQ ID N0:21. 
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25. The method of claim 24, wherein said XylT comprises an amino acid 
sequence having at least 95% sequence identity to the sequence set forth in SEQ ID 
N0:6 or SEQIDN0:21. 

26. The method of claim 25, wherein said XylT comprises the amino acid 
sequence set forth in SEQ ID N0:6 or SEQ ID N0:2 1. 

27. The method of claim 24, wherein the expression or function of said 
XylT is inhibited by a method selected from the group consisting of: 

(a) introducing a polynucleotide or polypeptide into said duckweed 
plant or duckweed plant cell or nodule, wherein said polynucleotide or polypeptide 
inhibits expression or function of said XylT in said a duckweed plant or duckweed 
plant cell or nodule; 

(b) eliminating a gene in said duckweed plant or duckweed plant 
cell or nodule, wherein said gene encodes said XylT; and 

(c) mutating a gene in said duckweed plant or duckweed plant cell 
or nodule, wherein said gene encodes said XylT. 

28. The method of claim 27, comprising introducing into said duckweed 
plant or duckweed plant cell or nodule a nucleotide construct comprising a nucleotide 
sequence that is capable of inhibiting expression or function of said XylT in said 
duckweed plant or duckweed plant cell or nodule, wherein said nucleotide sequence 
that is capable of inhibiting expression or function of said XylT is operably linked to a 
promoter that is functional in a plant cell. 

29. The method of claim 28, wherein said nucleotide sequence that is 
capable of inhibiting expression or function of said XylT comprises a sequence 
selected from the group consisting of: 

30 (a) the nucleotide sequence set forth in SEQ ID N0:4, SEQ ID 

NO: 19, or a complement thereof; 

(b) the nucleotide sequence set forth in SEQ ID N0:5, SEQ ID 
NO:20, or a complement thereof; 
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(c) a nucleotide sequence having at least 90% sequence identity to 
the sequence of preceding item (a) or (b); and 

(d) a fragment of the nucleotide sequence of any one of preceding 
items (a) through (c), wherein said fragment comprises at least 75 contiguous 

S nucleotides of said nucleotide sequence. 

30. The method of claim 28, wherein said nucleotide sequence that is 
capable of inhibiting expression or function of said XylT comprises in the 5 -to-3' 
orientation and operably linked: 

(a) a XylT forward fragment, said XylT forward fragment 
comprising about 500 to about 800 contiguous nucleotides having at least 90% 
sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 
nucleotides of SEQ ID N0;4, SEQ ID N0:5, SEQ ID NO: 1 9, or SEQ ID NO:20; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 

(c) and a XylT reverse fragment, said XylT reverse fragment 
having sufficient length and sufficient complementarity to said XylT forward 
fragment such that said nucleotide sequence that is capable of inhibiting expression or 
function of said XylT is transcribed as an RNA molecule capable of forming a hairpin 
RNA structure, 

31. The method of claim 30, wherein said XylT reverse fragment 
comprises the complement of said XylT forward fragment or a sequence having at 
least 90% sequence identity to the complement of said XylT forward fragment. 

25 

32. The method of claim 30 or 3 1 , wherein: 

(a) said XylT forward fragment comprises nucleotides (nt) 3 1 8- 
1052 of SEQ ID N0:4 and wherein said spacer sequence is an intron; or 

(b) said XylT forward fragment comprises nt 1-734 of SEQ ID 
30 NO: 1 9 and said spacer sequence is an intron. 

33. The method of claim 24, wherein a nucleotide construct comprising a 
fusion polynucleotide that is capable inhibiting expression or function of said FucT 
and said XylT in said duckweed plant or duckweed plant cell or nodule is introduced 
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into said duckweed plant or duckweed plant cell or nodule, wherein said fusion 
polynucleotide is operably linked to a promoter that is functional in a plant cell. 

34. The method of claim 33, wherein said fusion polynucleotide comprises 
5 in the 5*-to-3' orientation and operably linked: 

(a) a chimeric forward fragment, said chimeric forward fragment 
comprising in either order: 

(i) a first fragment comprising about 500 to about 650 
contiguous nucleotides having at least 90% sequence identity to a nucleotide sequence 

10 of about 500 to about 650 contiguous nucleotides of a polynucleotide encoding said 

FucT; and 

(ii) a second fragment comprising about 500 to about 650 
contiguous nucleotides having at least 90% sequence identity to a nucleotide sequence 
of about 500 to about 650 contiguous nucleotides of a polynucleotide encoding said 

15 XylT; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 

(c) and a reverse fragment, said reverse fragment having sufficient 
length and sufficient complementarity to said chimeric forward fragment such that 

20 said fusion polynucleotide is transcribed as an RNA molecule capable of forming a 

hairpin RNA structure. 

35. The method of claim 34, wherein said first fragment comprises about 
500 to about 650 contiguous nucleotides having at least 90% sequence identity to a 

25 nucleotide sequence of about 500 to about 650 contiguous nucleotides of SEQ TD 

NO: 1; and said second fragment comprises about 500 to about 650 contiguous 
nucleotides having at least 90% sequence identity to a nucleotide sequence of about 
500 to about 650 contiguous nucleotides of SEQ ID N0:4 or SEQ ID NO: 19. 

30 36. The method of claim 35, wherein said reverse fragment comprises the 

complement of said chimeric forward fragment or a sequence having at least 90% 
sequence identity to the complement of said chimeric forward fragment. 
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37. The method of claim 34 or 35, wherein: 

(a) said chimeric forward fragment comprises nucleotides (nt) 254- 
855 of SEQ ID NO: 1 and nt 3 1 8-943 of SEQ ID N0:4, and wherein said spacer 
sequence is an intron; or 
5 (b) said chimeric forward fragment comprises nucleotides (nt) 254- 

855 of SEQ ID NO: 1 and nt 1 -626 of SEQ ID NO: 1 9, and wherein said spacer 
sequence is an intron. 

38. A method for altering the A/^glycosylation pattern of a heterologous 
10 polypeptide produced in a duckweed plant or duckweed plant cell or nodule, said 

method comprising: 

(a) inhibiting expression or function of at least one a P 1 ,2- 
xylosyltransferase (XylT) in said duckweed plant or duckweed plant cell or nodule, 
wherein said XyltT comprises an amino acid sequence having at least 90% sequence 

1 5 identity to the sequence set forth in SEQ ID N0:6 or SEQ ID N0:2 1 ; and 

(b) culturing said duckweed plant or duckweed plant cell or nodule 
under conditions suitable for expression of said heterologous polypeptide. 



39. The method of claim 38, wherein said XylT comprises an amino acid 
20 sequence having at least 95% sequence identity to the sequence set forth in SEQ ID 

N0:6 or SEQIDN0:21. 



40. The method of claim 39, wherein said XylT comprises the amino acid 
sequence set forth in SEQ ID N0:6 or SEQ ID N0:21. 

25 

41. The method of claim 38, wherein the expression or function of said 
XylT is inhibited by a method selected from the group consisting of: 

(a) introducing a polynucleotide or polypeptide into said duckweed 
plant or duckweed plant cell or nodule, wherein said polynucleotide or polypeptide 

30 inhibits expression or function of said XylT in said a duckweed plant or duckweed 

plant cell or nodule; 

(b) eliminating a gene in said duckweed plant or duckweed plant 

cell or nodule, wherein said gene encodes said XylT; and 
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(c) mutating a gene in said duckweed plant or duckweed plant cell 
or nodule, wherein said gene encodes said XylT. 

42. The method of claim 41, comprising introducing into said duckweed 

5 plant or duckweed plant cell or nodule a nucleotide construct comprising a nucleotide 

sequence that is capable of inhibiting expression or function of said XylT in said 
duckweed plant or duckweed plant cell or nodule, wherein said nucleotide sequence 
that is capable of inhibiting expression or function of said XylT is operably linked to a 
promoter that is functional in a plant cell. 

10 

43. The method of claim 42, wherein said nucleotide sequence that is 
capable of inhibiting expression or function of said XylT comprises a sequence 
selected from the group consisting of: 

(a) the nucleotide sequence set forth in SEQ ID N0:4, SEQ ID 
1 5 NO: 1 9, or a complement thereof; 

(b) the nucleotide sequence set forth in SEQ ID N0:5, SEQ ID 
NO: 20, or a complement thereof; 

(c) a nucleotide sequence having at least 90% sequence identity to 
the sequence of preceding item (a) or (b); and 

20 (d) a fragment of the nucleotide sequence of any one of preceding 

items (a) through (c), wherein said fragment comprises at least 75 contiguous 
nucleotides of said nucleotide sequence. 



44. The method of claim 42, wherein said nucleotide sequence that is 
25 capable of inhibiting expression or function of said XylT comprises in the 5'-to-3' 

orientation and operably linked: 

(a) a XylT forward fragment, said XylT forward fragment 
comprising about 500 to about 800 contiguous nucleotides having at least 90% 
sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 

30 nucleotides of SEQ ID N0:4, SEQ ID N0:5, SEQ TD NO: 1 9, or SEQ ID N0:2 1 ; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 
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(c) and a XylT reverse fragment, said XylT reverse fragment 
having sufficient length and sufficient complementarity to said XylT forward 
fragment such that said nucleotide sequence that is capable of inhibiting expression or 
function of said XylT is transcribed as an RNA molecule capable of forming a hairpin 
RNA structure. 

45. The method of claim 44, wherein said XylT reverse fragment 
comprises the complement of said XylT forward fragment or a sequence having at 
least 90% sequence identity to the complement of said XylT forward fragment. 



46. The method of claim 44 or 45, wherein: 

(a) said XylT forward fragment comprises nucleotides (nt) 3 1 8- 
1052 of SEQ ID N0:4 and wherein said spacer sequence is an intron; or 

(b) said XylT forward fragment comprises nt 1-734 of SEQ ID 
1 5 NO: 1 9 and wherein said spacer sequence is an intron. 

47. The method of anyone of claims 1 5 through 46, wherein said 
heterologous polypeptide is a mammalian polypeptide. 

20 48. The method of claim 47, wherein said mammalian polypeptide is a 

monoclonal antibody. 

49. The method of claim 47, wherein said mammalian polypeptide is 
selected from the group consisting of an interferon, erythropoietin (EPO), tissue 

25 plasminogen activator (tPA), plasminogen, blood coagulation factors, granulocyte- 

macrophage colony stimulating factor (GM-CSF), and therapeutic immunoglobulins. 

50. A nucleotide construct comprising a first nucleotide sequence that is 
capable of inhibiting expression or function of an al,3-fucosyltransferase (FucT) in a 

30 plant, and a second nucleotide sequence that is capable of inhibiting expression or 

function of a pi,2-xylosyltransferase (XylT) in a plant, wherein said first nucleotide 
sequence and said second nucleotide sequence are operably linked to at least one 
promoter that is functional in a plant cell. 
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51. The nucleotide construct of claim 50, wherein said first nucleotide 
sequence is operably linked to a first promoter, and wherein said second nucleotide 
sequence is operably linked to a second promoter. 

5 

52. The nucleotide construct of claim 51, wherein said FucT is a 
polypeptide comprising an amino acid sequence selected from the group consisting 
of: 

(a) the amino acid sequence set forth in SEQ ID N0:3; and 
10 (b) an amino acid sequence having at least 90% sequence identity 

to the amino acid sequence set forth in SEQ ID N0:3, wherein said polypeptide has 
FucT activity. 



53. The nucleotide construct of claim 52, wherein said first nucleotide 
15 sequence comprises a sequence selected from the group consisting of: 

(a) the nucleotide sequence set forth in SEQ ID NO: 1 or a 
complement thereof; 

(b) the nucleotide sequence set forth in SEQ ID N0:2 or a 
complement thereof; 

20 (c) a nucleotide sequence having at least 90% sequence identity to 

the sequence of preceding item (a) or (b); and 

(d) a fragment of the nucleotide sequence of any one of preceding 
items (a) through (c), wherein said fragment comprises at least 75 contiguous 
nucleotides of said nucleotide sequence. 

25 

54. The nucleotide construct of claim 52, wherein said first nucleotide 
sequence comprises in the 5'-to-3* orientation and operably linked: 

(a) a FucT forward fragment, said FucT forward fragment 
comprising about 500 to about 800 contiguous nucleotides having at least 90% 

30 sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 

nucleotides of SEQ ID N0:1 or SEQ ID N0:2; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 
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(c) and a FucT reverse fragment, said FucT reverse fragjnent 
having sufficient length and sufficient complementarity to said FucT forward 
fragment such that said first nucleotide sequence is transcribed as an RNA molecule 
capable of forming a hairpin RNA structure. 

5 

55. The nucleotide construct of claim 54, wherein said FucT reverse 
fragment comprises the complement of said FucT forward fragment or a sequence 
having at least 90% sequence identity to the complement of said FucT forward 
fragment. 

10 

56. The nucleotide construct of claim 54 or 55, wherein said FucT forward 
fragment comprises nucleotides (nt) 255-985 of SEQ ID N0:1. 



57. The nucleotide construct of any one claims 54 through 56, wherein 
1 5 said spacer sequence of said first nucleotide sequence comprises an intron. 



58. The nucleotide construct of any one of claims 52 through 57, wherein 
said XylT is a polypeptide comprising an amino acid sequence selected from the 
group consisting of: 

20 (a) the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID 

N0:21;and 

(b) an amino acid sequence having at least 90% sequence identity 
to the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID N0:21, wherein said 
polypeptide has XylT activity. 

25 

59. The nucleotide construct of claim 58, wherein said second nucleotide 
sequence comprises a sequence selected from the group consisting of: 

(a) the nucleotide sequence set forth in SEQ ID N0:4 or SEQ ID 
NO: 19, or a complement thereof; 
30 (b) the nucleotide sequence set forth in SEQ ID N0:5 or SEQ ID 

NO:20, or a complement thereof; 

(c) a nucleotide sequence having at least 90% sequence identity to 
the sequence of preceding item (a) or (b); and 
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(d) a fragment of the nucleotide sequence of any one of preceding 
items (a) through (c), wherein said fragment comprises at least 75 contiguous 
nucleotides of said nucleotide sequence. 



S 60. The nucleotide construct of claim 58, wherein said second nucleotide 

sequence comprises in the 5'-to-3' orientation and operably linked: 

(a) a XylT forward fragment, said XylT forward fragment 
comprising about 500 to about 800 contiguous nucleotides having at least 90% 
sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 

10 nucleotides of SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 1 9, or SEQ ID NO:20; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 

(c) and a XylT reverse fragment, said XylT reverse fragment 
having sufficient length and sufficient complementarity to said XylT forward 

15 fragment such that said second nucleotide sequence is transcribed as an RNA 

molecule capable of forming a hairpin RNA structure. 



61. The nucleotide construct of claim 60, wherein said XylT reverse 
fragment comprises the complement of said XylT forward fragment or a sequence 

20 having at least 90% sequence identity to the complement of said XylT forward 

fragment. 

62, The nucleotide construct of claim 60 or 61, wherein said XylT forward 
fragment comprises nucleotides (nt) 318-1052 of SEQ ID N0:4 or nt 1-734 of SEQ 

25 ID NO: 19. 



63. The nucleotide construct of any one of claims 60 through 62, wherein 
said spacer sequence within said second nucleotide sequence comprises an intron. 

30 64. A nucleotide construct comprising a fusion polynucleotide that is 

capable of inhibiting expression or function of an al,3-fucosyltransferase (FucT) and 
a pi,2-xylosyltransferase (XylT) in a plant, wherein said fusion polynucleotide is 
operably linked to a promoter that is functional in a plant cell. 
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65. The nucleotide construct of claim 64, wherein said FucT is a 
polypeptide comprising an amino acid sequence selected from the group consisting 
of: 

5 (a) the amino acid sequence set forth in SEQ ID N0:3; and 

(b) an amino acid sequence having at least 90% sequence identity 
to the amino acid sequence set forth in SEQ ID N0:3, wherein said polypeptide has 
FucT activity. 

10 66. The nucleotide construct of claim 64 or 65, wherein said XylT is a 

polypeptide comprising an amino acid sequence selected from the group consisting 
of: 

(a) the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID 

N0:21;and 

1 5 (b) an amino acid sequence having at least 90% sequence identity 

to the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID N0:2 1 , wherein said 
polypeptide has XylT activity. 

67. The nucleotide construct of any one of claims 64 through 66. wherein 
20 said fusion polynucleotide comprises in the 5 -to-3' orientation and operably linked: 

(a) a chimeric forward fragment, said chimeric forward fragment. 

comprising: 

(i) a first fragment comprising about 500 to about 650 
contiguous nucleotides having at least 90% sequence identity to a nucleotide sequence 

25 of about 500 to about 650 contiguous nucleotides of a polynucleotide encoding said 

FucT; and 

(ii) a second fragment comprising about 500 to about 650 
contiguous nucleotides having at least 90% sequence identity to a nucleotide sequence 
of about 500 to about 650 contiguous nucleotides of a polynucleotide encoding said 

30 XylT; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 
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(c) and a reverse fragment, said reverse fragment having sufficient 
length and sufficient complementarity to said chimeric forward fragment such that 
said fusion polynucleotide is transcribed as an RNA molecule capable of forming a 
hairpin RNA structure. 

5 

68. The nucleotide construct of claim 67, wherein said reverse fragment 
comprises the complement of said chimeric forward fragment or a sequence having at 
least 90% sequence identity to the complement of said chimeric forward fragment. 

10 69. The nucleotide construct of claim 67 or 68, wherein said spacer 

sequence comprises an intron. 



70. The nucleotide construct of claim 67, wherein said first fragment 
comprises about 500 to about 650 contiguous nucleotides having at least 90% 

1 5 sequence identity to a nucleotide sequence of about 500 to about 650 contiguous 

nucleotides of SEQ ID NO: 1 ; and said second fragment comprises about 500 to about 
650 contiguous nucleotides having at least 90% sequence identity to a nucleotide 
sequence of about 500 to about 650 contiguous nucleotides of SEQ ID N0:4 or SEQ 
ID NO: 19. 

20 

7 1 . The nucleotide construct of claim 70, wherein said reverse fragment 
comprises the complement of said chimeric forward fragment or a sequence having at 
least 90% sequence identity to the complement of said chimeric forward fragment. 

25 72. The nucleotide construct of claim 70 or 7 1 , wherein said chimeric 

forward fragment comprises: 

(a) nucleotides (nt) 254-855 of SEQ ID NO: 1 and nt 3 1 8-943 of 
SEQ ID N0:4; or 

(b) nucleotides (nt) 254-855 of SEQ ID NO: 1 and nt 1-626 of SEQ 

30 IDN0:19. 

73. The nucleotide construct of any one of claims 70 through 72, wherein 
said spacer sequence comprises an intron. 
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74. The nucleotide construct of any one of claims 64 through 66, wherein 
said fusion polynucleotide comprises in the 5'-to-3' orientation and operably linked: 

(a) a chimeric forward fragment, said chimeric forward fragment 

5 comprising: 

(i) a first fragment comprising about 500 to about 650 
contiguous nucleotides having at least 90% sequence identity to a nucleotide sequence 
of about 500 to about 650 contiguous nucleotides of a polynucleotide encoding said 
XylT; and 

10 (ii) a second fragment comprising about 500 to about 650 

contiguous nucleotides having at least 90% sequence identity to a nucleotide sequence 
of about 500 to about 650 contiguous nucleotides of a polynucleotide encoding said 
FucT; 

(b) a spacer sequence comprising about 200 to about 700 

15 nucleotides; 

(c) and a reverse fragment, said reverse fragment having sufficient 
length and sufficient complementarity to said chimeric forward fragment such that 
said fusion polynucleotide is transcribed as an RNA molecule capable of forming a 
hairpin RNA structure. 

20 

75. The nucleotide construct of claim 74, wherein said reverse fragment 
comprises the complement of said chimeric forward fragment or a sequence having at 
least 90% sequence identity to the complement of said chimeric forward fragment. 

25 76. The nucleotide construct of claim 74 or 75, wherein said spacer 

sequence comprises an intron. 



77, The nucleotide construct of claim 74, wherein said first fragment 
comprises about 500 to about 650 contiguous nucleotides having at least 90% 
30 sequence identity to a nucleotide sequence of about 500 to about 650 contiguous 

nucleotides of SEQ ID N0:4 or SEQ ID NO: 19; and said second fragment comprises 
about 500 to about 650 contiguous nucleotides having at least 90% sequence identity 
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to a nucleotide sequence of about 500 to about 650 contiguous nucleotides of SEQ ID 
N0:1. 

78. The nucleotide construct of claim 77, wherein said reverse fragment 
comprises the complement of said chimeric forward fragment or a sequence having at 
least 90% sequence identity to the complement of said chimeric forward fragment. 

79. The nucleotide construct of claim 77 or claim 78, wherein said spacer 
sequence comprises an intron. 



80. A nucleotide construct comprising a first polynucleotide sequence that 
is capable of inhibiting expression or function of an al,3-fucosyltransferase (FucT) in 
a plant, wherein said first polynucleotide sequence comprises in the 5'-to-3' 
orientation and operably linked: 

15 (a) a FucT forward fragment, said FucT forward fragment 

comprising about 500 to about 800 contiguous nucleotides having at least 90% 
sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 
nucleotides of a polynucleotide encoding said FucT; 

(b) a spacer sequence comprising about 200 to about 700 

20 nucleotides; 

(c) and a FucT reverse fragment, said FucT reverse fragment 
having sufficient length and sufficient complementarity to said FucT forward 
fragment such that said first polynucleotide sequence is transcribed as an RNA 
molecule capable of forming a hairpin RNA structure; 

25 wherein said first polynucleotide sequence is operably linked to a promoter that is 

functional in a plant cell. 

8 1 . The nucleotide construct of claim 80, wherein said FucT reverse 
fragment comprises the complement of said FucT forward fragment or a sequence 

30 having at least 90% sequence identity to the complement of said FucT forward 

fragment. 
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82. The nucleotide construct of claim 80 or 8 1 , wherein said spacer 
sequence of said first polynucleotide comprises an intron. 

83. The nucleotide construct of claim 80, wherein said FucT is a 

5 polypeptide comprising an amino acid sequence selected from the group consisting 

of: 

(a) the amino acid sequence set forth in SEQ ID N0:3; and 

(b) an amino acid sequence having at least 90% sequence identity 
to the amino acid sequence set forth in SEQ ID N0:3, wherein said polypeptide has 

10 FucT activity. 

84. The nucleotide construct of claim 83, wherein said FucT forward 
fragment comprises about 500 to about 800 contiguous nucleotides having at least 
90% sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 

1 5 nucleotides of SEQ ID NO: 1 or SEQ ID N0:2. 

85. The nucleotide construct of claim 84, wherein said FucT reverse 
fragment comprises the complement of said FucT forward fragment or a sequence 
having at least 90% sequence identity to the complement of said FucT forward 

20 fragment. 

86. The nucleotide construct of claim 84 or 85, wherein said FucT forward 
fragment comprises nucleotides (nt) 255-985 of SEQ ID NO: 1 . 

25 87. The nucleotide construct of any one claims 84 through 86, wherein 

said spacer sequence of said first polynucleotide sequence comprises an intron. 

88. A nucleotide constmct comprising a first polynucleotide sequence that 
is capable of inhibiting expression or function of a pi,2-xylosyltransferase (XylT) in a 
30 plant, wherein said first polynucleotide sequence comprises in the 5 -tO'3' orientation 

and operably linked: 

(a) a XylT forward fragment, said XylT forward fragment 
comprising about 500 to about 800 contiguous nucleotides having at least 90% 
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sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 
nucleotides of a polynucleotide encoding said XylT; 

(b) a spacer sequence comprising about 200 to about 700 

nucleotides; 

5 (c) and a XylT reverse fragment, said XylT reverse fragment 

having sufficient length and sufficient complementarity to said XylT forward 
fragment such that said first polynucleotide sequence is transcribed as an RNA 
molecule capable of forming a hairpin RNA structure; 

wherein said first polynucleotide sequence is operably linked to a promoter that is 
1 0 functional in a plant cell. 



89. The nucleotide construct of claim 88, wherein said XylT reverse 
fragment comprises the complement of said XylT forward fragment or a sequence 
having at least 90% sequence identity to the complement of said XylT forward 

15 fragment. 

90. The nucleotide construct of claim 88 or 89, wherein said spacer 
sequence within said first polynucleotide sequence comprises an intron. 

20 91. The nucleotide construct of claim 88, wherein said XylT is a 

polypeptide comprising an amino acid sequence selected from the group consisting 
of: 

(a) the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID 

N0:21;and 

25 (b) an amino acid sequence having at least 90% sequence identity 

to the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID N0:21, wherein said 
polypeptide has XylT activity. 

92. The nucleotide construct of claim 91, wherein said XylT forward 
30 fragment comprises about 500 to about 800 contiguous nucleotides having at least 

90% sequence identity to a nucleotide sequence of about 500 to about 800 contiguous 
nucleotides of SEQ ID N0:4. SEQ ID N0:5, SEQ ID NO: 1 9. or SEQ ID NO:20. 
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93. The nucleotide construct of claim 92, wherein said XylT reverse 
fragment comprises the complement of said XylT forward fragment or a sequence 
having at least 90% sequence identity to the complement of said XylT forward 
fragment. 

5 

94. The nucleotide construct of claim 92 or 93, wherein said XylT forward 
fragment comprises nucleotides nt 318-1052 of SEQ ID N0:4 or nt 1-734 of SEQ ID 
NO: 19. 

10 95. The nucleotide construct of any one claims 92 through 94, wherein 

said spacer sequence within said first polynucleotide sequence comprises an intron. 

96. The nucleotide construct of any one of claims 50 through 95, further 
comprising at least one polynucleotide encoding a polypeptide of interest, wherein 

1 5 said polynucleotide encoding said polypeptide of interest is operably linked to a 

promoter that is functional in a plant cell. 

97. The nucleotide construct of claim 96, wherein said polypeptide of 
interest is a mammalian polypeptide. 

20 

98. The nucleotide construct of claim 97, wherein said mammalian 
polypeptide is a monoclonal antibody. 

99. The nucleotide construct of claim 97, wherein said manunalian 
25 polypeptide is selected from the group consisting of an interferon, erythropoietin 

(EPO), tissue plasminogen activator (tPA), plasminogen, blood coagulation factors, 
granulocyte-macrophage colony stimulating factor (GM-CSF), and therapeutic 
immunoglobulins. 

30 1 00. A vector comprising a nucleotide construct according to any one of 

claims 50 through 99. 
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101 . A plant or plant cell comprising a nucleotide construct according to 
any one of claims 50 through 99 stably integrated into its genome, 

102. The plant or plant cell of claim 101 , wherein said plant is a monocot or 
5 said plant cell is from a monocot. 

103. The plant or plant cell of claim 102, wherein said monocot is a member 
of the Lemnaceae, 

10 104. The plant or plant cell of claim 103, wherein said monocot is from a 

genus selected from the group consisting of the genus Spirodela, genus Wolffia, genus 
Wolflella, genus Landoltia, and genus Lemna. 

105. The plant or plant cell of claim 104, wherein said monocot is a member 
15 of a species selected from the group consisting of Lemna minor, Lemna miniscula, 

Lemna aequinoctialis, and Lemna gibba. 

106. The plant or plant cell of claim 101, wherein said plant is a dicot or 
said plant cell is from a dicot. 

20 

1 07. A method for stably transforming a higher plant to express 
glycoproteins having an altered A^-glycosylation pattern, said method comprising 
introducing into said higher plant the nucleotide construct according to any one of 
claims 50 through 95. 

25 

108. The method of claim 107, wherein said higher plant is a plant host 
expressing at least one heterologous polypeptide of interest. 

109. The method of claim 108, wherein said heterologous polypeptide of 
30 interest is a mammalian polypeptide or biologically active variant thereof. 

1 10. The method of claim 109, wherein said mammalian polypeptide is a 
monoclonal antibody. 



- 166- 



wo 2007/084922 



PCTAJS2007/060642 



111. The method of claim 109, wherein said mammalian polypeptide is 
selected from the group consisting of an interferon, erythropoietin (EPO), tissue 
plasminogen activator (tPA), plasminogen, blood coagulation factors, granulocyte- 

S macrophage colony stimulating factor (GM-CSF), and therapeutic immunoglobulins. 

112. A method for stably transforming a higher plant to express a 
heterologous polypeptide of interest having an altered A/-glycosylation pattern, said 
method comprising introducing into said higher plant the nucleotide construct 

10 according to any one of claims 96 through 99. 

113. The method of claim 1 12, wherein the A^-glycosylation pattern is 
characterized by a reduction in the attachment of al,3-fucose residues to iV-glycans 
attached to said heterologous polypeptide. 

15 

1 14. The method of claim 1 12, wherein the //-glycosylation pattern is 
characterized by a reduction in the attachment of pi,2-xylose residues to //-glycans 
attached to said heterologous polypeptide. 

20 115. The method of claim 1 12, wherein the A^glycosylation pattern is 

characterized by a reduction in the attachment of al,3-fucose residues and pi,2- 
xylose residues to A^-glycans attached to said heterologous polypeptide. 

116. A method for producing a heterologous mammalian glycoprotein in a 
25 higher plant, wherein said heterologous manunalian glycoprotein has an a reduction in 

the attachment of al,3-fucose and pi,2-xylose residues to A^-glycans of said 
glycoprotein when produced in said higher plant, said method comprising: 

(a) introducing into said plant an expression cassette comprising a 
sequence encoding a mammalian polypeptide that is post-translationally processed as 

30 said glycoprotein, and a polynucleotide comprising the nucleotide construct of any 

one of claims 50 to 79; and 

(b) culturing said plant under conditions suitable for expression of 
said glycoprotein. 
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1 17. The method of claim 1 16, wherein said mammalian polypeptide is a 
monoclonal antibody. 

5 1 18. The method of claim 1 16, wherein said mammalian polypeptide is 

selected from the group consisting of an interferon, erythropoietin (EPO), tissue 
plasminogen activator (tPA), plasminogen, blood coagulation factors, granulocyte- 
macrophage colony stimulating factor (GM-CSF), and therapeutic immunoglobulins. 

10 119. The method of any one of claims 107 through 1 1 8, wherein said higher 

plant is a monocot. 

120. The method of claim 1 19, wherein said monocot is a member of the 
Lemnaceae, 

15 

121. The method of claim 120, wherein said monocot is from a genus 
selected from the group consisting of the genus Spirodela, genus Wolffia, genus 
Wolfiella, genus Landoltia, and genus Lemna. 

20 122. The method of claim 121, wherein said monocot is a member of a 

species selected from the group consisting of Lemna minor, Lemna miniscula, Lemna 
aequinoctialis, and Lemna gibba, 

123. The method of any one of claims 107 through 118, wherein said higher 
25 plant is a dicot. 

1 24. A method for reducing heterogeneity of the iV-glycosylation profile of 
a glycoprotein produced in a higher plant, said method comprising introducing into 
said plant the nucleotide construct according to any one of claims 50 through 99, and 

30 culturing said plant under conditions suitable for expression of said glycoprotein. 

125. The method of claim 124, wherein said glycoprotein is an endogenous 
glycoprotein. 
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126. The method of claim 124, wherein said glycoprotein is a heterologous 
glycoprotein. 

127. The method of claim 126, wherein said heterologous glycoprotein is a 
5 manunalian glycoprotein. 

128. The method of claim 127, wherein said mammalian glycoprotein is a 
monoclonal antibody. 



10 129. The method of claim 127, wherein said manunalian glycoprotein is 

selected from the group consisting of an interferon, erythropoietin (EPO), tissue 
plasminogen activator (tPA), plasminogen, blood coagulation factors, granulocyte- 
macrophage colony stimulating factor (GM-CSF), and therapeutic immunoglobulins. 

15 1 30. The method according to any one of claims 1 24 through 1 32, wherein 

said higher plant is a monocot. 

131. The method of claim 1 30, wherein said monocot is a member of the 
Lemnaceae. 

20 

1 32. The method of claim 131, wherein said monocot is from a genus 
selected from the group consisting of the genus Spirodela, genus Wolffla, genus 
Wolfiella, genus Landoltia, and genus Lemna. 

25 133. The method of claim 1 32, wherein said monocot is a member of a 

species selected from the group consisting of Lemna minor, Lemna miniscula, Lemna 
aequinoctialis, and Lemna gibba. 

134. The method according to any one of claims 131 through 133, wherein 
30 the reduced heterogeneity of said AT-glycosylation profile is maintained with scale-up 

in production of said plant, wherein production scale is increased by at least 6,500- 
fold. 
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135. The method according to any one of claims 13 1 through 134, wherein 
the reduced heterogeneity of said A/-glycosylation profile is maintained with 
continuous clonal culture of said plant. 

5 136. The method of claim 1 35, wherein the reduced heterogeneity of said N- 

glycosylation profile is maintained with continuous clonal culture of said plant for at 
* least 8 months. 

137. The method according to any one of claims 124 through 132, wherein 
1 0 said higher plant is a dicot. 

138. An isolated polynucleotide comprising a nucleotide sequence selected 
from the group consisting of: 

(a) the nucleotide sequence set forth in SEQ ID NO: 1 or SEQ ID 

15 N0:2; 

(b) the nucleotide sequence set forth in SEQ ID N0:4, SEQ ID 
N0:5; SEQ ID NO: 19, or SEQ ID NO:20; 

(c) a nucleotide sequence encoding a polypeptide comprising the 



20 



25 



30 



lino acid sequence set forth in SEQ ID N0:3, SEQ ID N0:6; or SEQ ID N0:21 ; 

(d) a nucleotide sequence comprising at least 90% sequence 
identity to the sequence set forth in SEQ ID N0:1 or SEQ ID N0:2, wherein said 
polynucleotide encodes a polypeptide having al,3-fucosyltransferase (FucT) activity; 

(e) a nucleotide sequence comprising at least 90% sequence 
identity to the sequence set forth in SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO: 1 9. or 
SEQ ID NO:20, wherein said polynucleotide encodes a polypeptide having pi,2- 
xylosyltransferase (XylT) activity; 



(f) a nucleotide sequence comprising at least 15 contiguous 
nucleotides of SEQ ID NO: 1 , SEQ ID N0:2, SEQ ID N0:4, SEQ ID N0:5, SEQ ID 
NO: 19, or SEQ ID NO:20, or a complement thereof; 

(g) a nucleotide sequence comprising at least 19 contiguous 
nucleotides having at least 90% sequence identity to a nucleotide sequence 
comprising at least 19 contiguous nucleotides of SEQ ID NO: 1, SEQ ID N0:2, SEQ 
ID N0:4. SEQ ID N0:5, SEQ ID NO: 19, or SEQ ID NO:20 and a complement 



thereof; 
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(h) a nucleotide sequence encoding an amino acid sequence having 
at least 90% sequence identity to the sequence set forth in SEQ ID N0:3. wherein said 
polynucleotide encodes a polypeptide having FucT activity; 

(i) a nucleotide sequence encoding an amino acid sequence having 
5 at least 90% sequence identity to the sequence set forth in SEQ ID NO: 6 or SEQ ID 

N0:21, wherein said polynucleotide encodes a polypeptide having XylT activity; and 

(j) the complement of the nucleotide sequence of any one of 
preceding items (a) through (i). 

10 1 39, An expression cassette comprising the polynucleotide of claim 1 3 8. 

140. The expression cassette of claim 139, wherein said polynucleotide is 
operably linked to a promoter that is functional in a plant cell. 

15 141. A plant or plant cell comprising the expression cassette of claim 1 40. 

142 The plant or plant cell of claim 141, wherein said plant is a monocot or 
said plant cell is from a monocot. 

20 143. The plant or plant cell of claim 142, wherein said monocot is a member 

of the Lemnaceae, 

144. The plant or plant cell of claim 143, wherein said monocot is from a 
genus selected from the group consisting of the genus Spirodela, genus Wolffia, genus 

25 Wolfiella, genus Landohia, and genus Lemna, 

145. The plant or plant cell of claim 144, wherein said monocot is a member 
of a species selected from the group consisting oi Lemna minor, Lemna miniscula, 
Lemna aequinoctialis, and Lemna gibba, 

30 

146. The plant or plant cell of claim 141 , wherein said plant is a dicot or 
said plant cell is from a dicot. 
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147. The plant or plant cell of any one of claims 141 through 146, wherein 
said polynucleotide is stably incorporated into the genome of the plant or plant cell 

148. An isolated polypeptide comprising an amino acid sequence selected 
5 from the group consisting of: 

(a) the amino acid sequence set forth in SEQ ID N0:3; 

(b) an amino acid sequence having at least 90% sequence identity 
to the amino acid sequence set forth in SEQ ID N0:3, wherein said polypeptide has 
al,3-fucosyltransferase (FucT) activity; 

10 (c) an amino acid sequence comprising at least 20 consecutive 

amino acids of SEQ ID N0:3, wherein said polypeptide has FucT activity; 

(d) the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID 

N0:21; 

(e) an amino acid sequence having at least 90% sequence identity 

1 5 to the amino acid sequence set forth in SEQ ID N0:6 or SEQ ID N0:2 1 . wherein said 

polypeptide has pi,2-xylosyltransferase (XylT) activity; and 

(f) an amino acid sequence comprising at least 20 consecutive 
amino acids of SEQ ID N0:6 or SEQ ID N0:21, wherein said polypeptide has XylT 
activity. 

20 

149. A composition comprising a glycoprotein produced in accordance with 
the method of anyone of claims 1 16 through 1 37. 
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DNA and amino acid sequence information 

Lemna minor alpha 1-3 fucosyltransferase: 

Fucosyl Transferase 

3^1 R 





L. minor RACE PGR fucosyltransferase 

1865 bp 

FucT isoform #1 cDNA sequence: (UTR's in standard font, coding region in bold) 

acgcggggggaagtggttgagtagctcagtggaaaattggaaatgtctattagagggggaagaggggagggatccgagggg 
aacgaggaaggtgtgccgaattctcgtagatttcttcaattcctgcagatctcgtcttctctctgatttcttcccgagctc 
cgcccgtaggaactcaatcggactcgatccaagttgacgaggcctacygaggaaggcgattttccgaagccctgcgatcga 
tqqccacctct gctgctggtgctctcaacgccgqtqgcaqggtcqqgqgcagqaggaqttqqqtcaqattgcttcccttct 
ttqtqttgatqctggtggtaqgggagatctggttcctcgggcgqctqqatqtqqtcaaqaacqccqctatqqttcaaaact 
qqacttcctcccacttgtttttcttaccaqtttcttcctacacgtqgtccgaqaccgtcaaggaggaagagqattqcaagg 
actggctqgaaaqaqtaqatqcqqtcgattacaaqaqaqatttccqtqtqqaacccqttctqqtaaatqacqctqaacagg 
attqqaqttcatqttcaqtqgqctqtaaqttcqqatcattccccqqaaqaacqcctqatqctacatttggtttctctcaqa 
atccatcaacaqtcaqtqtccatcqatccatqqaatcatcccattattatttqqaqaataatcttqataatqcacqacqqa 
aaggctatcaaattgtgatgacaactagtctcttgtcaqatqtgcctgtcgqttatttctcatgggctqaatatqatatca 
tgqcqcctcttcaqccgaaaactgctqqtqcacttqctqctqcatttatatctaattqcqgaqcacgtaatttccgcttgc 
agqcccttqatatgctcqaaaaqtcgaatattaaqattgattcatatggtqcttqccatcgcaaccaagacqqtaaaqtqq 
acaaggtacaaactttgaagcggtataagttcagcttagcttttgaaaactcgaacgaggatgactatgttactgagaagt 
tctttcaatctcttqtcactggagctattcctqttqtcgtcqgagcccccaacattcaaaattttQcaccatcttctqatt 
caattctQcacatcaQqaaacccaaqQatqtcaqttcaqtcqctqaaaaaataaaatttctcqcttcaaatccaqaaqcat 
ataaccaatcactaaQatQqaaQtttqaaQacccttctaactccttcaaaQccctqataaacatagcaacagttcactcct 
cctqccqcctatqGattcacattgccaccaagatcaqaqaqaaggaaq'aqaaaaaccccraatttcaagactcgcccttaca 
aQtacacGGqGaataaatctaGcttatatcaGttatacqcccg^cqaaacyac^acacctttaa cttcttatcaatcttcataa 
gatcggataatctatcactgaaagcgctggggtcaacagttcttgagaaattcagttctttgaagcacgtgccgatttgga 
agaaggagaggccagagagtctgaaaggagggagcaagctggatctttacagaatctatccagtgggcattactcagagag 
aagctctcttctctttccagttcaacactgacaaagaaGttcaaatctaccttgaatcccatccatgtgcgaagtttgaag 
tcatctttatttgatccctgaggtaattaggtcacgaattcagctaatttggttaattatgcttcaagcccacatggtatt 
tcatatcattaattgaaggcatagttagttgatattgacattttcgtctaggatcattctaaagtctatcccaatgaactt 
aa 

FucT isoform #1 predicted amino acid sequence: 

matsaagalnaggrvggrrswvrllpf f vlmlvvgeiwf Igridvvknaamvqnwtsshlf f Ipvssytwsetvkeeedck 
dwlervdavdykrdf rvepvlvndaeqdwsscsvgckfgsfpgrtpdatf gf sqnpstvsvhrsmesshyylennldnarr 
kgyqivmttsllsdvpvgyf swaeydimaplqpktagalaaaf isncgarnf riqaldmleksnikidsygachrnqdgkv 
dkvqtikrykf slaf ensneddyvtekf f qslvagaipwvgapniqnf apssdsilhirepkdvssvaermkf lasnpea 
ynqslrwkf egpsnsf kalvdmaavhsscrlcihiatkirekeernpnf ktrpckctrngstlyhlyarergtfdf Isif m 
rsdnlslkalgstvlekf sslkhvpiwkkerpeslkggskldiyriypvgitqreailf sfqf ntdkeiqiyleshpcakfe 
vif i* 



FIG. 1 



wo 2007/084922 



PCT/US2007/060642 



2/59 

Amino acid sequence alignment of alpha 1^ fucosyltransferase from Lemna minor md other 
higher plants 

Total alignment positive - 89.2% 
Total alignment identity -41.1% 

Triticum aestivum: Genbank accession# AJ582 182,1 
Hordeum vulgare: Genbank accession# AJ582 181.1 
Oryza sativa: Genbank accession# XM_482907.1 
Medicago sativa: Genbank accession# AY082444.1 
Arabidopsis thaliana: Genbank accession# AYl 84990.1 



Lemna minor FucT (1 

Triticum aestivum FucT (1 

Hordeum vulgare FucT (1 

Oryza sativa FucT (1 

Medicago sativa FucT (1 

Arabidopsis thaliana FucT (1 

Consensus (1 



Lemna minor FucT (17 

Triticum aestivum FucT (13 

Hordeum vulgare FucT (12 

Oryza sativa FucT (18 

Medicago sativa FucT (27 

Arabidopsis thaliana FucT (36 

Consensus (36 



Lemna minor FucT (52 

Triticum aestivum FucT (48 

Hordeum vulgare FucT (47 

Oryza sativa FucT (53 

Medicago sativa FucT (62 

Arabidopsis thaliana FucT (71 

Consensus (71 



Lemna minor FucT (76 

Triticum aestivum FucT (69 

Hordeum vulgare FucT (68 

Oryza sativa FucT (88 

Medicago sativa FucT (81 

Arabidopsis thaliana FucT (87 

Consensus (106 
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Lemna minor FucT 
Triticum aestivum FucT 
Hordeum vulgare FucT 
Oryza sativa FucT 
Medicago sativa FucT 
Arabidopsis thaliana FucT 

Consensus 
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Lemna minor beta 1-2 xylosyltransferase: 



XylT 



51ITR 



3*UTR 




Lemna minor beta 1-2 xj^osvltransf erase 

1860 ip 



XylT isoform #1 cDNA sequence: (UTR's in standard font, coding region in bold) 

ggcttccaaccggaggatctcgagctgaagaatcttcatgactgaagaattcatgtgatcccatggctttggtgaactcgc 
gagggagcagggtcagacgcatcgcgaagcccaccttcgttttcctcttgatcaacgtagtctgtctcctgtactttttcc 
gtcagaaccctaatcccattcccgacgcttgtcttcacggggaatgcgacaaacccccgattttagtgactccccggcgat 
ggaacttgaagccatggccgattcttccttcctttctgccatgggtgccgagctcccaccctgcccagggctcctgcgaag 
cctacttcggcaacagcttcaaccgccqgacqqaqatgctgaagaaqgtagaggqaagagqatggttccagtqcctqtaca 
qcqatactcttcqaaqttctqtttqccaqqqaqqqaatttqcqqatqqacccqqaaaqqattaqqatqtcqaaaqgqqqqg 
aaqatctaqaqqaqqtqatqaaqagaqaqqaqqaaqaaqaattqcccaaattcqaqqaqqqqtcqttccaqattqaatctq 
qttatqqaaqcqqaqqqqaaqttqqaqaqaqaattqcqactqacqaqqtcctcgataatqttqtqccqaaaqqcqctgttc 
atqtacataccatqcqcaatctcatcaqttcqattcaqattqttqqtcccqggcatcttcaatgctctcaqtqqatcgacg 
aaccqqttcttcttgtcacacqcttcqaatacqccaatctctttcacaccqtcaccqactqqtacaqcqcctacqcaaqct 
cgaqqattqccaacttqccttctcgccctcacttaattttcqtcqatqqccattqcaqqqcqqaacaqttaqaqqacatqt 
qgaqaqccctqttctcqaccgtccgatactccaagaacttctcccaqccaatctqcttccqccacgtcqtcctctcacctc 
tgqqctatqaqacqqctctcttcaaaqqcctatcaqaqagcttcagctgtgagggagctccggccaatcggctcaaagtca 
accccaatgacq^ga^aactacaaaactgqctqaattcqaaqagataatcagaqccqcctttaactttcctqtcattqacc 
cgtccattgacccgttgaccaaatccatcctcttcgtgcggcgggaagattacgtggcgcacccacgccacagtgggagag 
tqqaatcqcqactaaccaacaaqcaaaaaqtqtttqactttctqcacaattqqQcaaatcatcacaqagqcaqqtacaaca 
tcaqtatggtcaacgggcttttcgcgcacatgggaiatgaaggaacagttgaaggcgattatggaagcttcggtggtggtgg 
qqgcccacgqqqctqqtttgacccatctggtggcagcaaggtcaacgacagttgttcttgagattctgagtaqtcaatacc 
gtagaccgcactttcaactgatttctcggtggaaagggttggactaccatgcaattaatcttgccgggtcatttgctgacc 
ctcgggaggtggtcgagaaattgactggcatagttgacaggcttggatgttgaaqaaaaq tqaaaatcaacatttagaatt 
ttaactttaaggggtggttaacaattgagcggcattgtcaacgggtttggatgctgggaaaagtgaaaatcaacacttgga 
gttctgacattgaaggcaagacgtggaattttgatggtgttgaggatatttggatgtggagttctgatgaattaaagcagg 
ggttgatcatttgccagtggaattatgttggtgtaagagagaagggggagaataaacagtgttagagagctatgctgg 



XylT isoform #1 predicted amino acid sequence: 

malvnsrgsrvrriakptfvf llinvvcllyf f rqnpnpipdaclhgecdkppilvtprrwnlkpwpilpsf Ipwvpsshp 
aqgsceayfgnsf nrrtemlkkvegrgwfqclysdtlrssvcqggnlrmdperirmskggedleevmkreeeeelplcf eeg 
sfqiesgygsggevgeriatdevldnvvpkgavhvhtmrnlissiqivgpghlqcsqwidepvllvtrf eyanlfhtvtdw 
ysayassrianlpsrphlif vdghcraeqledmwralf stvrysknf sqpicf rhvvlsplgyetaif kglsesf scegap 
anrlkvnpddqktarlaef gemiraafdfpvvdpsidpltksilfvrredyvahprhsgrvesrltneqevfdf Ihnwash 
hrgrcnismvnglf ahmgmkeqlkaimeasvvvgahgaglthlvaarsttvvleilssqyrrphf qlisrwkgldyhaini 
agsf adprevvekltgivdrlgc* 
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Amino acid sequence alignment of beta 1,2 xylosyltransferase from Lemna minor and other higher 
plants 



Total alignment positive - 76.6% 
Total alignment identity - 33.7% 



Medicago sativa: Genbank accession# AY30225 1 . 1 
Oryzasativa: Genbank accession# XM_483221.1 
A rabidopsis thaliana : Genbank accession# AJ272 121.1 
Saccharum officinarum: Genbank accession# AJ864704.1 
Nicotiana tabacum: Genbank accession# AJ627 1 82. 1 



Lemna minor XylT 
Medicago sativa XylT 
Oryza sativa XylT 
Arabidopsis thaliana XylT 
Saccharum officinarum XylT 
Nicotiana tabacum XylT 

Consensus 



1 35 
1 ) MALVNSRGSRV|||AiPTrVFLL| 

1) M^l^i^iFiii?' 

1 ) MMPVRTYHHHHHHNNSNNH|P^iliP^i|A|i^ 

1) MS^^i^|iiY|Li| 

1) MMAGRAHGHRN ^||Jp^||l^gp 

1) RL RK ILK ILL LFAI 



Lemna minor XylT (25) 

Medicago sativa XylT (15) 

Oryza sativa XylT (36) 

Arabidopsis thaliana XylT (19) 

Saccharum officinarum XylT (28) 

Nicotiana tabacum XylT (16) 

Consensus (36) 



70 

PR 



mY^|RQNP?lPIPDACLHGECDKPPIL^ 
girAA^I^RHQSPH PH PH PAADPERDAVDAAGG 



r1^FLNPTPSPN--l-DIKI 



SfflSiflSSFSPEQSQ— -iPHIYHV|VN 




iI^LQ||?QSESHHQSPPDiTPRTEARDG 

^ s sIIdh frhks pqnh f|n TQNH Y|L S 

NSVsii lYFL HSS P S 



Lemna minor XylT (60) 

Medicago sativa XylT (47) 

Oryza sativa XylT (71) 

Arabidopsis thaliana XylT (51) 

Saccharum officinarum XylT (63) 

Nicotiana tabacum XylT (51) 

Consensus (71) 



71 105 
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ENHHDNF— HS S VT|QYTiiil|iil||||^ - N P 
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Lemna minor XylT (81) 

Medicago sativa XylT (80) 

Oryza sativa XylT (106) 
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FucT RNAi design (single gene KO): 



FucT forward fragment 
1188 bp 



FucT reverse fragment 
730 bp 




• Fucosyltransferase (FucT) RNAi expression cassette was 
designed to express a hairpin RNA with 730 bp of double-stranded 
RNA sequence and a 458 bp spacer sequence. The RNAi 
expression cassette comprises a FucT forward fragment (nt 255- 
985 of SEQ ID N0:1 ), the spacer sequence (nt 986-1444 of SEQ 
ID N0:1), and the FucT reverse fragment (antisense version of nt 
255-985 of SEQ ID N0:1). 
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XylT RNAi design (single gene KO): 



XylT forward fragment 
1281 bp 



XylT reverse fragment 
734 bp 




• Xylosyltransferase (XylT) RNAi expression cassette was 
designed to express a hairpin RNA with 734 bp of double-stranded 
RNA sequence and a 547 bp spacer sequence. The RNAi 
expression cassette comprises the XylT forward fragment (nt 318- 
1052 of SEQ ID N0:4), the spacer sequence (nt 1053-1599 of 
SEQ ID N0:4), and the XylT reverse fragment (antisense version 
of nt 318-1 052 of SEQ ID N0:4). 



FIG. 6 
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FucT + XylT chimeric RNAi design: 



FucT 
forward 
602 bp 



XylT 
forward 
1126 bp 





XylT 
reverse 
626 bp 




FucT 
reverse 
602 bp 




• Chimeric FucT + XylT RNAi expression cassette was designed 
to express a hairpin RNA with 602 bp of double-stranded FucT 
sequence, 626 bp of double-stranded XylT sequence, and a 500 
bp spacer sequence. The RNAi expression cassette comprises in 
the 5' to 3' direction and operably linked: the FucT forward 
fragment (nt 254-855 of SEQ ID N0:1), the XylT forward fragment 
(nt 318-943 of SEQ ID N0:4), a spacer sequence (nt 944-1443 of 
SEQ ID N0:4), the XylT reverse fragment (antisense version of nt 
318-943 of SEQ ID N0:4), and the FucT reverse fragment 
(antisense version of nt 254-855 of SEQ ID N0:1 ). 



FIG. 7 
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XylT RNAi expression cassette 
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FucT / XylT double knockout 
Chimeric RNAi expression cassette 
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FucT / XylT double knockout 
Single gene RNAi expression cassette 
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IgGl co-expression with FucT / XylT 
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IgGl co-expression with FucT / XylT 
double knockout (single gene RNAi) 
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Lemna minor beta 1-2 xylosyltransferase (isoform 2) 



XyTT boform #2 (partial cDNAsequence) 



yUTR 




XylT (isoform 2) used in RNAi constructs 



XylT isoform #2 partial cDNA sequence (coding sequence in bold font, 3^UTR sequence in 
standard font): 

tgc2aagcctacttcggcaacagcttcaaccgccggacggagatgctgaagaaggtagag2gaagaggatggttccagtgcct2taca 

gcgatactcttcgaagttctgtttgccagggagggaa tttgcggatggacccggaaaggattaggatgtcgaaagggggggaagatcta 

gaggaggtgatgaagagagaggaggaagaagaattgcccaaattcgaggaggggtcgttccagattgaatctggttatggaagcgga 

ggggaagttggagagagaattgcgactgacgaggtcctcgataatgttgtgccgaaaggcgctgttcatgtacataccatgcgcaatctc 

atcagttcgattcagattgttggtcccgggcatcttcaatgctctcagtggatcgacgaaccggttcttcttgtcacacgcttcgaatacgcc 

aatctctttcacaccgtcaccgactggtacagcgcctacgcaagctcgaggattgccaacttgccctctcgccctcacttgattttcgtcgat 

ggccattgcagggcagaacagttagaggacacgtggcgagccctgttctcaaccgtccgatacgccaagaacttctcccagccagtctgc 

ttccgccacgccgtcctctcccctcttggctatgagacagctctcttcaaaggcctatcagagagcttcagctgtgagggagtgccggccaa 

tcagctcaaagtcaaccctgatgaccagaagactgcgaqactggctgaattcggagagatgatca gggctgccttt^actttcctgtcgtt 

gacccgcccgttgacccgttgaccaaatccatcctctttgtgcg gcgggaagattacgtggcgcacccacgccacagtgggagagtggag 

tcgcggttgaccaatgagcaap[apgtgtttgactttctgcacaaatgggcaagtc aacacagaa^caggtgcaacgtcagtgtggtcaac 

pggcttttcgcgcacatgggaatpaaggaacaggtgaaggcaattatggaagcttcggtggtggtc^g^^cccacggggctggtttgact 

catctggtggcagcaaggtcaacgacagttgttcttgagattctga^cagtcaatatcgtagaccgcactttcaactgatttca cpptp^aa 

afj;fj;^tt^ gactaccacgcaattaatcttgccgggtcgtatgctgatcctcpggag^tggtcgagaaattpactggcat agtcgatgggctt 

ggatgttgaagataag 

XylT isoform #2 predicted amino acid sequence (partial sequence): 

ceayfgnsfhrtemlkkvegrgwfqclysdtlrssvcqggnlrmdpcrirmskggedleev^ 

geriatdevldnwpkgavhvhtmmlissiqivgpghlqcsqwidepvllvtrfeyanlfhtvtdwysayassrianlpsrph 

qledtwraifstvayaknfsqpvcfrhavlsplgyetalfkglsesfscegvpanqlkvnpddqktarlaefgem 

ilfvrredyvahprhsgrvesrltneqevfdflhkwasqhrsrcnvsvvnglfahmgmkeqvkaime 

eilssqyrrphfqlisrwkgldyhainlagsyadprewekltgivdglgc* 
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LmXylT Isoform#l (1 
LmXylT Isoform #2 (1 
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XylT RN Ai design (single gene KO) I 



• Xylosyltransferase (XylT) RNAi expression cassette was 
designed to express a hairpin RNA with 734 bp of double-stranded 
RNA sequence and a 547 bp spacer sequence, based on the XylT 
isoform #2 partial DNA sequence. The RNAi expression cassette 
comprises the XylT forward fragment (nt 1-734 of SEQ ID NO: 19), 
the spacer sequence (nt 735-1282 of SEQ ID N0:19), and the 
XylT reverse fragment (antisense version of nt 1-734 of SEQ ID 




XylT forward fragment 
1281 bp 



XylT reverse fragment 
734 bp 



N0:19). 



FIG. 33 
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FucT + XylT chimeric RNAi design: 



FucT 
forward 
602 bp 





XylT 
reverse 
626 bp 





• Chimeric FucT + XylT RNAi expression cassette was designed 
to express a hairpin RNA with 602 bp of double-stranded FucT 
sequence, 626 bp of double-stranded XylT sequence, and a 500 
bp spacer sequence. The RNAi cassette encompasses in the 5' 
to 3' direction and operably linked: the FucT fonward fragment (nt 
254-855 of SEQ ID N0:1 ), the XylT forward fragment from isoform 
#2 (nt 1-626 of SEQ ID NO: 19, a spacer sequence (nt 627-1 126 of 
SEQ ID NO: 19), the XylT reverse fragment (antisense version of 
nt 1-626 of SEQ ID NO: 19), and the FucT reverse fragment 
(antisense version of nt 254-855 of SEQ ID N0:1 ). 



FIG. 34 
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MDXA04 Expression Vector 




P1 L T P2 RNAi T P3 H T 



RB 





FucT 
sense 



XylT 
sense 



XylT FucT 
antisense antisense 
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FIG. 40 
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